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ON DOMINANCE RELATIONS AND THE STRUCTURE OF 
ANIMAL SOCIETIES: II. 


SOME EFFECTS OF POSSIBLE SOCIAL FACTORS 


H. G. LANDAU 


COMMITTEE ON MATHEMATICAL BIOLOGY 
THE UNIVERSITY OF CHICAGO 


In a previous paper (Landau, 1951) it was shown that a society 
with a dominance relation would rarely tend to be close to the hierarchy 
in structure if dominance is determined solely by the inherent character- 
istics of the members. Here we consider the effects of other factors, 
due to social rank or to the outcome of previous encounters which af- 
fected dominance. 

The following results are obtained. A uniform bias against rever- 
sal of dominance will have no effect on the stationary distribution of the 
structure of the society. If the probability of dominance is a linear 
function of the previously established score (number of members domi- 
nated), there will be a small tendency for the society to move toward 
the hierarchy; but this is negligible for large societies. If a member 
never. challenges another whose score exceeds his own by two or more, 
or if he can never dominate if he should challenge, then the hierarchy 
is the only stable structure. 

From the last result it is concluded that social factors which restrict 
challenges or the probability of dominance could easily account for so- 
cieties close to the hierarchy, such as are observed in flocks of domestic 


hens. 
The effectiveness of social bias in establishing hierarchies is much 


greater in small societies than in large ones. 


1. INTRODUCTION. This paper continues the study of the struc- 
ture of societies with a dominance relation between every pair of 
members. Familiarity with the previous paper (Landau, 1951, re- 
ferred to as 1) is assumed. 

One of the principal conclusions of I was that if dominance is 
determined solely by the inherent characteristics of the members, 
then a society with a dominance relation would rarely be close to the 
hierarchy. By inherent characteristics we mean all those which are 
not affected by the social status, i.e., the dominance relations, of any 
of the members or by the results of any previous encounters in which 
dominance relations were affected. Since, however, societies which 
are hierarchical in structure, except perhaps for a small number of 
triangles* (j > k, k > 1,1 > j) are known to occur, particularly 


*A list of symbols is given on page 261, 
245 
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among domestic hens, it is important to consider whether social fac- 
tors could produce hierarchies. By social factors we mean all those 
tactors affecting dominance which are not due to inherent charac- 
teristics of the members; i.e., social factors are those due to the exist- 
ing or previous social structure, or dominance relations, or to the 
outcome of any previous encounters affecting dominance. 

This paper considers the mathematical aspects of some possible 
social factors. Certain assumptions are made as to the operation of 
factors which depend on the previous history of the society and its 
members, and conclusions as to the effect on the structure of the 
society are derived. A detailed discussion of the mechanism of such 
factors in social, psychological or physiological terms is not at- 
tempted. Thus, whether a particular factor could be ascribed to so- 
cial lag, conditioning or hormone concentration will not be discussed. 

The essential difference between the present considerations and 
those of I is as follows. In I, the picture was that of the m members 
of the society coming together and engaging in N = n(n — 1)/2 con- 
tests—one for each pair, or a single round robin—with the result of 
each contest fixing the direction of dominance for the pair involved. 
The outcome of each contest was completely independent of that of 
any other contest, but depended only on_inherent characteristics of 
the members. Our present picture is that the N dominance rela- 
tions have been determined by some process, and that thereafter con- 
tests (or encounters) occur between some of the pairs. Only one 
contest is supposed to occur at a time; and we consider various as- 
sumptions about the way in which the probabilities of contests or of 
their outcomes may depend on the previous dominance relations. Each 
such contest may change the structure of the society, and we wish 
to be able to draw conclusions about the way in which the structure 
will vary.* Considerations of this kind were introduced by A. Rapo- 
port (1949, 1950), who calculated some effects of social factors in 
societies with three members. 

A change of structure can occur, as a result of the contests we 
are considering, only if the dominance relation is reversed, but re- 
versal of a dominance relation does not necessarily result in a differ- 
ent structure. This point will, of course, be considered in detail in 


*Other possible treatments might be to consider the development of the 
structure from an initial state when no dominance relations had yet been estab- 
lished, through the successive stages when 1, 2, ---- , N dominance relations 
have been established. Or again, we might consider what happens when a new 
member is introduced into a society with an established structure, and thus build 


up our society by successively adding new members. These approaches will not 
be treated here. 
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connection with the different factors which are assumed. The kinds 
ot tactors to be considered here include bias against reversal of domi- 
nance, the probability of dominance being dependent on previous 
score, and the probability of the subordinate member of a pair chal- 
lenging the dominant one being dependent on previous score. 

In I, we considered the society to be a random sample of n indi- 
viduals from a population with a given distribution of inherent char- 
acteristics. Here we do not consider the society to be a random sam- 
ple, but are instead looking at a society of n fixed individuals and 
wish to determine how the social structure changes with time. In 
some cases it is also assumed that the individuals are the same with 
respect to inherent characteristics. 

We are interested in determining whether the effect of social 
factors is to lead to an ultimate stable structure for the society, and, 
if so, what kind of structure. In general there is no single stable 
structure, and we would like to be able to characterize the distribu- 
tion of structures through which the society passes as contests go on. 

2. MATHEMATICAL METHOD. The effect of social factors can- 
not be treated by the mathematical technique used in I, because the 
probability of dominance now depends on existing dominance rela- 
tions. The mathematical procedure for handling problems of this 
kind is the theory of Markov chains. An excellent introduction to 
this theory is given in W. Feller (1950), and a more detailed ac- 
count can be found in M. Fréchet (1938). For terminology and re- 
sults not proven here we refer to Feller. 

The theory of Markov chains* is concerned with a system which 
can exist in several states, H,, H., H;,----. If the system is in state 
E., the probability that it will change to state H, is given by the 
transition probability dag. 

In the application to societies with dominance relations, the 
states E. are the possible structures of the society. The transition 
probabilities aag then give the probability that the structure will 
change from EF. to Eg as the result of an encounter. 

In applying the Markov chain theory we are faced by the diffi- 
culty that the measure of structure which was used in I, the hier- 
archy index h, cannot be used to define the H.. This is because, as 
was pointed out in I, more than one dominance structure can have the 
same score structure and hence the same value of h, but the deter- 
mination of the transition probabilities will, in general, require the 


*Only finite chains are needed here, and it is assumed that the transition 
probabilities are constant. 
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knowledge of dominance structure. Thus in the exampie given in I 
(p. 3), if in the first matrix the dominance relation 5 > 1 is re- 
versed to 1 > 5, we obtain a structure with V = (4,2,2,2,0), but 


this score structure cannot be obtained from the second matrix by any - 


single reversal of a dominance relation. 

The fact that 2 alone does not suffice for the treatment oi social 
factors means that our considerations in the following are more in- 
volved than in I. However, h is still a useful measure as will be 
seen below. 

First we wish to show that under certain conditions a stationary 


distribution of structures independent of the initial structure will — 


exist. A simple sufficient (but not necessary) condition is that all 
€jxDjx > 0, Where ej, is the probability of an encounter between mem- 
bers j and k, and pj, is the probability that 7 > k after the encounter. 
This condition implies that in any structure the probability that any 
dominance relation be either reversed or maintained is not zero. We 
can now refer to Feller (1950; Theorem 2, p. 325). The non-zero 
probability of maintaining any dominance relation means that 
Gac > 0, hence the chain is aperiodic. Also since any reversal of 
dominance relations has a non-zero probability, it is possible to go 
from any structure to any other by a finite number of one-step transi- 
tions, so that the chain is irreducible. From the theorem referred to, 
it then follows that there is a stationary probability distribution, 
Ua , probability of the structure E., satisfying 


Up 20, te—1, (1) 


and 
Ua = > Up Aga . (2) 
8 


This theorem was proven by Rapoport (1949) without referring to 
Markov chains. 

The interpretation of this stationary distribution of structures 
is as follows. As encounters occur which may change dominance re- 
lations, the structure of the society changes going from Ea, to Ea " 
Eu; , etc. The distribution 222 then gives the distribution of the struc- 
tures which the society takes on during the course of time. If we as- 
sume that encounters occur at equal intervals of time (since we are 
not concerned here with the actual time rate of encounters), then we 


gives the fraction of the time during which the society has the struc- 
ture E.. 
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This conclusion can be related to the work in I. In | we consid- 
ered societies of n individuals drawn from a population with a given 
distribution of inherent characteristics which determined dominance 
probability. These individuals engaged in a single round robin. The 
resulting distribution of structures depended on two sets of random 
factors: (1) the inherent characteristics of the particular n indi- 
viduals, and (2) the outcome of the contests. We now omit the first 
set of random factors, i.e., we now consider a society of n fixed in- 
dividuals with inherent characteristics determining dominance prob- 
ability; then the distribution due to the varying possible outcome 
of the contests is exactly the same as would be given by the distribu- 
tion Ua above, by using the Markov chain theory. This is to be ex- 
pected from the uniqueness of the stationary distribution for finite, 
aperiodic irreducible chains (Feller, 1950, p. 329), and an explicit 
proof is given in the Appendix (Theorem Al). This property is re- 
ferred to as the ergodic property of such chains. To restate: If domi- 
nance is affected only by inherent characteristics then a society whose 
structure changes due to continued encounters will in the course of 
time pass through the same distribution of structures as would result 
from the possible outcomes of round robin tournaments for that so- 
ciety. 

In this connection, it is worth stating the formula for E(h) for 
a society of n fixed individuals with inherent characteristics deter- 
mining dominance probability. This was not explicitly stated in I, 
but follows directly from (7) and (15) there, which give 


3 
E (h) =— iecr ey su . (3) 
TL n (ee) tag, 
jz 

3. Bias AGAINST REVERSAL OF DOMINANCE. The first social fac- 
tor which we consider is that of bias against reversal of dominance. 
This could be interpreted as some sort of conservative factor or ten- 
dency to maintain existing dominance relations. 

We consider a society of » individuals with inherent character- 
istics which give the dominance probabilities (0), the probability 
that 7 > k, before any encounters have occurred. We now assume 
that if actually 7 > &, then the probability of reversal of dominance 


at the next encounter is 
Dui = (1 — €) Dx; (0), (4) 


with 
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0-6 <1. 


This says that dominance once having been established, the probabil- 
ity of reversal is reduced compared to the original probability of the 
reversed direction of dominance. We are also making here the very 
restrictive assumption that the factor 1 — ¢, by which the probabil- 
ity of reversal is reduced, is the same for all pairs, i.e., the bias is 
uniform. 

It is shown in the Appendix (Theorem A2) that under these as- 
sumptions the stationary distribution, ua, is independent of the bias 
against reversal of dominance, «. . 

This result is a little surprising, especially when we notice that 
the excluded case, « = 1, means that no reversal is possible so that 
the structure can never change. But Theorem A2 states that any uni- 
form bias, no matter how large as long as it permits a non-zero prob- 
ability of reversals, has no effect on the ultimate distribution of struc- 
tures. Of course, a large « means that the structure is less likely to 
change. It is more likely to remain as it is than shift to a new struc- 
ture,* but as long as « < 1 there is a.positive probability of shifting, 
and once shifted it again tends to remain in the new structure. How- 
ever, in the long run the fraction of time spent in each structure is 
the same as if there were no bias against reversal. 

Of course, the assumption that the bias against reversal of domi- 
nance is uniform is much more special than we would like. Any so- 
cially conservative factors that operated against reversal of domi- 
nance would probably also depend on other factors, such as the rela- 
tive social rank of the pair of individuals concerned. However, it has 
not as yet been possible to carry through the treatment of bias against 
reversal on more general assumptions except for the case treated in 
Section 5. For very small societies the complete calculations can be 
made in detail, and this has been done by Rapoport (1950) forn = 8. 

4. DOMINANCE PROBABILITY A LINEAR FUNCTION OF SCORE DIF- 
FERENCE. Another possibility for the manner in which a social or psy- 
chological factor might operate is through affecting the dominance 
probability according to the scores of the individuals. The score of 
an individual is, of course, a measure of his social rank. 

In the case of a flock of hens it is not reasonable to assume that 
each hen counts the number of hens pecked by each of her flock-mates, 
compares this with her own score, and acts accordingly. However, 
she can be expected to observe‘roughly relatively how often a particu- 


neat Bate tendency to persist in one state could be measured by the serial corre- 
lation coefficient of h. This would increase with increasing «&. 
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lar hen pecks or is pecked, and that her behavior will be influenced by 
this rough estimate of social rank. This seems to be especially true 
of the behavior toward those hens who are near the top and also those 
near the bottom. 


As a first case, let us assume that the probability of dominance at 
any time is a linear function of the score difference of the pair in- 
volved, i.e., we take as the probability that j > k at time t to be 
given by 

Piet | Lt w(v;-— Ux) ], (5) 


where w is a constant and v; is the score from the preceding encoun- 
ters. The dominance probability is here assumed to depend only on 
score difference and not on any other factors, either inherent or so- 
cial. We also assume that the probability of encounter is the same 
for all pairs. 

On this basis, the expected value of the hierarchy index, EH (h), 
can be calculated. Here, since the structure of the society changes 
with time, E'(h) is the average value of h over a long period. It is 
shown in the Appendix that 


H 
TE) ree ee, (6) 
1—w(n—2)/2 
where H = 3/(n + 1) is E(h) for ow = 0. The case w = 0 is that 
of no dominance bias, and the present formula, of course, gives the 
same result as found in I, equation (49). 
In interpreting (6) it must be remembered that p;, in (5) is a 
probability, so p;, < 1; and since v; — v; can be as large as n — 1, We 
must have 


1 


n—1 


(7) 


os 


We also assume w > 0. For negative w see Section 6. 

For small n, when w has the maximum value given by (7), the 
structure of the society will be close to the hierarchy: for n = 85 
E(h) =1and for n= 5, E(h) = 4/5. But for large n, we have 


E(hy = 6/n, (8) 


so that the society will be near equality Tore of the time than it is 


near the hierarchy. 
It is not hard to see the reason for this result for large n. The 


limitation w < 1/(n — 1), made necessary by the assumption of lin- 
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earity, means that for large n, the bias in favor of the higher rank- 
ing member of any pair will not be appreciable until the difference in 
score becomes relatively large. As 7 increases, the proportion of pairs 
for which the score difference is large will decrease, so that the ef- 
fect of the linear bias decreases. 

5. BIASES WHICH MAKE THE HIERARCHY CERTAIN. It was sug- 
gested by Rapoport (1950) that if encounters and reversals are con- 
fined to individuals of equal social rank, then the hierarchy would 
result. This would bea very severe restriction, and what we now show 
is that the hierarchy is the only stable structure under rather less se- 
vere restrictions on possible encounters and reversals of dominance. 
Theorem A4 of the Appendix states that if the probability that 7 > k 
is equal to 1 whenever v; => v, + 2, then the hierarchy is the only 
stable structure. This restriction is equivalent to saying that re- 
versal of dominance is impossible if the subordinate member’s score 
is less than that of the dominant member of the pair by two or more. 
Still another statement of the same result would be that the struc- 
ture is certain to become the hierarchy, if a member never challenges 
another whose score exceeds his own by two or more. 


For n = 3, the present extreme bias is the same as the maximum 
linear bias, so that we have another way of seeing why the hierarchy 
is the only stable state in this case. 


It should be noted that the hierarchy we are considering here is 
not stable in the sense of the individual members maintaining their 
relative positions. Instead, it would be quite possible for individuals 


EXTREME BIAS 


eee 


NO BIAS 
-(n-!) 4{n-2) n-2 Nn-i 


_FIGURE 1. The relation between the biases which increase dominance prob- 
ability according to score difference. Note that for the extreme bias, p;, is un- 
specified for |v; — | <1. 
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in adjacent ranks to be constantly interchanging positions; but these 
are the only interchanges which would be possible and they do .not 
alter the hierarchical structure. The zoologist studying a flock of hens 
would probably want to define stability in terms of the position of the 
individual hens remaining unchanged. Our result does not refer to 
stability in this sense, but it can be pointed out that if in the above 
theorem we replace the score difference of two by one, we get this 
kind of stability. This means that reversals can occur only between 
individuals with the same score. Once the hierarchy is attained, there 
could be no further change of dominance relations. 


6. GENERAL REMARKS. We have been looking for social biases 
that would give near-hierarchical structures. Figure 1 illustrates the 
cases treated of a bias that depends on score difference. For no bias, 
that is, pj, —4—=0, and even for the maximum linear bias, the hier- 
archy will be rare except in very small societies. The extreme bias 
just treated does make the hierarchy a certainty, but is probably too 
restrictive to be realistic. It would seem likely that biases lying be- 
tween the linear one and the extreme would make the hierarchy more 
likely, even for large n as the bias approached the extreme case. It 
has not yet been possible to carry out the analysis of the effect of 
such biases. 

However, we can conclude that a social bias which makes the 
preservation of dominance more likely the greater the difference in 
social rank (as measured by score difference) could account for the 
prevalence of near-hierarchical structures in flocks of hens. Also, 
the larger the society the stronger does the bias need to be to estab- 
lish the hierarchy; or, in other words, equal biases will be less effec- 
tive in making for the hierarchy in a large compared to a small so- 
ciety. This statement is, of course, not exact because we have given 
no measure of the strength of a social bias. It does agree with the 
observed fact that very large hen flocks are less likely to have the 
hierarchical structure. 

So far we have mainly been trying to account for the hierarchy, 
and have implicitly assumed that all biases were in this direction. It 
seemed, at first, unreasonable to consider negative values of e or w 
or reversal of the inequality in the last section. However, it occurs 
to the writer that this may be a bias in his own thinking induced by 
the culture pattern of his own society. For example, R. Benedict 
(1934) describes the socially enforced habit of cooperation among 
the Zufi. Perhaps biases favoring the subordinate member should 
be given equal consideration, if only to indicate what might occur in 
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a cooperative society. There is no difference in the mathematics, only 
changes of sign are needed. 

Thus in Section 3, the uniform bias « could be negative [so long 
as (1 — «)px;(0) < 1] and then favor reversal of dominance. As be- 
fore, this would have no effect on the stationary distribution, but now 
instead of tending to persist in its existing structure the society 
would tend to shift away from the existing structure. 

Making w negative would tend to move any society, even small 
ones, toward equality. For # = — 1/(n — 1) (its largest negative 
value), H(h) = 2/n for large n. 

Reversing the inequality in Section 5 can be easily seen to re- 
sult in h having its lowest possible value; i.e., zero for n odd or 
3/(n? — 1) for n even. The only possible structures are the equality, 
or a structure as close to it as possible for n even. 


APPENDIX 


The dominance structure was defined in I as a set of matrices 
such that any two matrices of the set could be transformed into each 
other by a permutation of rows and corresponding columns. For the 
application of Markov chain theory it is more convenient to take as 
the individual states not the dominance structure but the individual 
matrices used to define it, i.e., each state is a skew-symmetric matrix, 
Ma. with the elements mz,;, such that 


Majze—1 if 7>k. 


Taking these matrices to be our states amounts to considering two 
states as distinct when different individuals occupy corresponding 
positions in the two, even though one matrix can be transformed into 
the other by a row and column permutation; i.e., renaming the indi- 
viduals. The states thus defined form a finite Markov chain with con- 
stant transition probabilities, and the resulting stationary distribu- 
tions can be stated in terms of the distribution of dominance struc- 
tures by combining the probabilities, ua, for those states which be- 
long to the same dominance structure. 

The advantage of using these matrices for the states is immedi- 
ately apparent. Any reversal of a dominance relation will now carry 
the state M. into a different state. There will be exactly N such 
states, which we will denote by M.. Thus the transition probabilities 
Gag (and aga) are in every case equal to zero except for f=aor 
one of the N values of a’. Also each such transition, Mz > Ma or 
Ma ~ Ma, is associated with the reversal of dominance between a 
particular pair of individuals, which we denote by ja and ka: . 
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The total number of states is 2”. 

The theorem of Section 2 can now be proven as follows. 

Theorem Al. If the dominance probabilities, p;,, depend only 
on inherent characteristics, and all encounters are equally probable, 
then the directly calculated distribution of structures is the same as 
the stationary distribution. 

Proof: Let ua‘ be the probability of M. in the initial distribu- 
tion which is the same as that directly calculated in I, and ua be the 
distribution after n encounters. We show that wa = wa; hence 
Ua’ = Ua™ for all n. Since the stationary distribution is wa = lim 


nro 
ta™, it follows that wa = Ua. 
Now 


Uu ta) = HT P (Ma, jx), (9) 


where 


P(ma,jx) = Pj, if Majyp=t+1, 
= Prj =1— Pj, if Majyx=—1. 


If the one-step transition Ma —~ Ma is possible, then wa, differs 
from wu. in a single term in (9), namely, 


P (Me j 4g) =1—P (Maj ,,44:) » 
which replaces 
P(Ma,j akg) ? 
_ so that 
1— P(Ma,j 4 kq:) 
Ui uaa Nig toe ees (10) 


P(Ma,j,,%,,) 
The denominator cannot be zero, since this would imply daa =O. 
We have 
sl 
Qa'a Ny P(Ma,j akg) » (11) 


where 1/N is the probability of encounter. Also 


1 
daa = DS Vk (Ma, jx) : (12) 
j<k 
Now 


QN 


Ua Sat Apa 
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Using (10), (11), (12) we obtain 


Sater (one | 


Ug” Wan 


> P (me, jx) + 


Ug —— 
j<k a’=1 
Ug”? Ug”? 
=e > P (ma, jx) + -S (11 — P (ma, jx) ] 
j<k j<k 
— Weer. 


so that the initial distribution is unchanged, and by the remarks 
above it must be the same as the stationary distribution. 
From now on we use the convention that the pair ja’,ka are re- 
lated as follows: 
ja: > ka in Ma, OF Maj « —=T+1; 
(13) 


Kea > ja’ in Ma, OY Maj » =—1. 
fo Rare ts 


We now prove the theorem of Section 3. 
eA Theorem A2. The stationary distribution is independent of uni- 
form bias against reversal, «. 
Proof: The transition probabilities are 


Qaa’ —Cj; & De ij ,; 
a! har a’ a’ 


dag —O0, for B ¥a,a’, (14) 


2N 
Q@aa—1—> ag - 
B=1 
B4a 
We indicate the dependence of the probabilities on e by writing 
dag(e), etc. Then, since the probability of any reversal is 


Duj (€) = (1 — €) px; (0), (15) 
we have 
dag(e) = (1— «) dag (0), for B¥a. 
Since we assume that a stationary distribution exists, 

QNn 

Ua(e) = S ug(e) Aga(e) , (16) 
p= 

Qn 
Ea 


a=1 
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Using (14) and (15) 
Ua(e) =Ua(e) [1 — (1—«) > acg(0)] 


Bea 
+ (1—e) 5 ug(e)dga(0). 
Bea 
Since 1— e > 0, this gives 
0= = Uae) S dap (0) + Dd ug(e) aga (0) 


Bea Bea 


=— ta(e) [1 — daa(0)] + D up (e) aga(0) , 


Bea 
or 


tta (2) = 3 ug (2) age(0) , 
B=1 


but these are the equations satisfied by wua(0). Hence from the 
uniqueness of the solution of (16) it follows that ua is independent 
of e. 

From this proof we get a result about Markov chains in general. 
If the transition probabilities, acg for a # 8, are each multiplied by 
a constant 2, 0 < 4 < 1, and to each daa is added (1 — A) (1 — Gaa), 
then the stationary distribution remains unchanged. 

_ We now derive Equation (6). 
ee Theorem A8. If 


Pie=4 [1 + w(v;—%)] 
where w is a constant, and 
€j,—1/N for allj andk, (17) 
then 
icin’ | ase 
1— w(n—1)/2 


Proof: Consider first the unbiased case, w = 0, pj, = 1/2, and 
denote the transition probabilities in this case by cag , so that 


E(h) = 


Caqr = 1/2N for the N values of a’, 
Cep—0 for Ba,a, (18) 
Con) aan 

We also note that cag = ¢ga, or the transition probability matrix is 


symmetric. 
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Let ha be the hierarchy index for Ma and M = 12/(n* — n) so 
he M. S (v; — v)*. Then the change in h due to the transition 
M. > My’ is | 

ha:-— ha=M [(v;,,—-1—%)? + (%,, +1 — 6)? 
— (vj, — 8)? — (v%,, —)?] (19) 
= 2M (tn aati 1): 


We need to evaluate the sums: 


2N 
Si (a) = 2 Cap he, 


¥ (20) 
S2(a) => Cap h’s. 
B=1 
Using (18) 
il N 
, — te eae ha ’ 
A een 
then, from (19), 
M N 
Si(a) =ha + Mie > (. = Ubi, ) (21) 


The last summation is the sum of the score differences, vj; — v;%, in 

Ma, over all pairs with 7 > k,ie., S (v; — vx). In this sum, each 
j>k 

v; occurs v; times with a positive sign (once for each member domi- 

nated by 7) and (n — 1 — v;) times with a negative sign (once for 

each member who dominates 7), so that 


y) (esl 
Ze 
obvi ie eas Eh, i 
7 J J 2 M a 


Hence 


2 
Ste) =m + (1-5, ) he (23) 
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S2(a) is similarly evaluated. It involves 


1 
ey ve) = — > 3 (0; — Ve)? 
j>k 2 


j=l k= 


1 n n 
=5 BE (Os — 0)? + (9) (24) 


=n5 ch =p 5S (0) | ape 


so that we obtain 
4 
& = 2M? + 2M 5 gene lige eae 
S2 (a) ( N ) ( N h (25) 


We can now write the transition probabilities for the general 
case, wo #0. Using (5), (18), and (19) we get fora #6, 


Gap = Cag [10 3 hod] (26) 
2Nv 
and, using ») dag — 1 and S,(a), we get 
B=1 
1 @ 
ea—l1+—h |, 27 
Qa 2 | H | ( ) 
3} 
where f.— ——. 
“ih ae ib 


Now if wa is the stationary distribution then 


2N (09) 
=> up Apa pies Cpa cae (p=) | 
ro 2M 
(28) 


Also 
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E(h) =D het 


a=1 


= 3 2 he tp Cpe Bese Oy — | 
ha 
(60) 
pa hea a 1+ 
+3 ro( =) 


1 
= Y = h2 
[Bay + EI | 


(29) 


7) 
2 


hg @ h j 
= a a = ee fe —-= a ° 
Fats 2h Cg E o( OM =| OM 
Using S,(a) and S.(a), these sums can be evaluated. The term in 
E'(h?) cancels out and we finally obtain 


H 


OOS ee 03 (i Dd. 


The theorem on biases which lead to the hierarchy is the follow- 
ing. 
Theorem A4. If 


Dix—1 for v3; —% 22 (30) 
and 


CjKD jk >0 for Vi —=— Vk > (31) 


then the hierarchy is the only stable structure. 

Proof: The proof depends on Equation (19) which shows that 
h can decrease by reversals, from ja > Ka to ka > ja, only when 
UE,, — Vig, < — 1 before reversal; otherwise, h is either unchanged 
or increases. 


Suppose that the structure is hierarchical. Then all the v’s are 
different and if 7 > k then v; > v,. From (30) a reversal of ja: > Ka’ 
to ka > ja: is possible only for Vk,, = Vi,,— 1. However, such a re- 
versal according to (19) leaves h unchanged. For the hierarchy h=1, 
and this is the only dominance structure with this value of h. So 
that a matrix M. corresponding to the hierarchy can change only 


into an Ma which also corresponds to the hierarchy. The hierarchy 
is stable. 
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Now suppose the structure is not the hierarchy. Then # cannot 
decrease because as we have seen from (19) this would require 
Ve, — Vi,, © — i, Whereas (30) makes the probability of such a re- 
versal equal to zero. Also since ejpj, > 0, for vj = v, reversals of 
dominance between individuals of equal score must occur. This means 
h cannot remain constant iiidefinitely, if the structure is not hier- 
archical. Any structure other than the hierarchy is unstable. Since 
h can take on only a finite number of values it must eventually reach 
and maintain the value h = 1. This completes the proof. 

In the language of Markoy chains, using dominance structures 
as our states, the hierarchy is an absorbing state and all other states 
are transient. It is also clear that this is the only bias depending on 
Vv; — v; that leads to the hierarchy as the single stable state; for if 
Dix > 0 for vj = v; — 2. then escape from the hierarchy is possible, 
and it is no longer an absorbing state. 


LIST OF SYMBOLS 


j,k, members of the society. 
j > k, 7 dominates k. 
nm, number of members in the society. 
n(n — 1) 
é 2 = 
E,, the possible structures of the society. 
Gag» probability of transition from E, to Eg. 
V = (v,, V2, °*** » Um), the score structure of the society. 
Vj» j’s score or number of members dominated by j. 


Uq, probability of Z, in stationary distribution. 
n ce 

h=M,> (v;—v)?, hierarchy index. 
Pati 


Nea 


M = 12/(n3 — n). 
v = (n — 1)/2, mean of 2,;. 
E(h), expected value of h. 
9 jx = Pix — Pxj- 
Djx» probability that 7 > Hie 
€;,» probability of an encounter of j andk. 
e, factor giving bias against reversal of dominance. 
w , factor giving effect of v; — vz, On Dj, - 
H=38/(n + 1). 
Cap» transition probability a,g when all p;, = 1/2). 
M,, structure matrices. 
Mq,;;» the element in the jth row and kth column of M,. 
ha, h of E, orM,. 

Qn 


S, (a) Sate Cap hg. 
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Qn 


S, (a) peer hg. 
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Lea’s formula for the determination of size of the sensitive volume 
is corrected and applied to experiments on inactivation of the tobacco 
mosaic virus using three different wave lengths of x-rays. The purely 
geometric concept of the sensitive volume is not sufficient for explain- 
ing the known dependence of the inactivation of this virus on wave 
length. The concept of transfer of energy around the ionizing particles 
is discussed in this connection in a manner similar to that suggested 
by Pollard and Forro for the inactivation of the T — 1 phage, and re- 
sults similar to those given by these authors are obtained from an 
analysis of the inactivation data of J. W. Gowen (tobacco mosaic virus). 
Differences of interpretation of these two sets of experiments carried 
out with different types of radiation are pointed out. 


' It is known that experiments on the effect of radiations on micro- 
organisms can be used to determine the volume of the region which 
must absorb an amount of administered energy in order to produce 
the observed effect. This volume is the so-called sensitive volume and 
is situated, in general, inside the microorganism. The possibility ex- 
ists, however, that energy originally absorbed in a medium surround- 
ing the microorganism is conveyed to the latter by various energy 
transfer processes, so that the size of the sensitive volume involved 
in a radiation experiment may be larger than the size of the whole 
microorganism. This is the reason the sensitive volume primarily has 
a physical meaning. The size of the biological entity responsible for 
the effect observed may be smaller or larger than that volume, since 
transfer of energy in and outside of the microorganism is possible 
in all directions. The details of the transfer of energy in this kind 
of process are not well known; consequently, no definite rela- 
tions between the two kinds of volumes can be established. If the 
effect of radiation is primarily due to a chemical reaction, the bio- 
logical or biochemical sensitive volume may, perhaps, appear in an 
ultimate analysis as some kind of spheroid whose length is that of a 


single chemical bond. 
263 
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When calculating the sensitive volume from the relationship be- 
tween the frequency of microorganisms showing a given effect and 
the administered amount of energy we must take into account the 
density of ionization produced by the type of radiation used. D. E. 
Lea (1947, pp. 83-7, 353-54) has given an approximate formula 
which does take this effect into account. It is the purpose of the pres- 
ent note to derive an exact formula and to apply it to a set of experi- 
ments on inactivation of the tobacco mosaic virus. 

Lea assumes that, if one ion or one cluster of ions occurs in the 
sensitive volume, this is sufficient to produce the observed effect. The 
theory leading to an exact expression of his formula can be presented 
in the following manner: The sensitive volume is assumed to be a 
sphere of radius rv. The probability of occurrence of at least one ion 
(or of at least one ion cluster) within a length x of a track of an 
ionizing particle is assumed to be, according to Poisson distribution, 
equal to: 

eee 


where L is the mean distance between two consecutive ions. This 
formula is, of course, an approximation and does not express the 
well-known fact that the density of ionization increases as the par- 
ticle is slowed down in its passage through the matter. The ionizing 
tracks are not assumed to end within the sensitive volume. The prob- 
ability of entry of an ionizing particle into, or of its exit out of, the 
sphere through an infinitesimal surface area is assumed to be pro- 
portional to the size of the latter independently of its position. Let 
us consider the point of entry of the particle into the sphere and the 
angle between the diameter passing through that point and the track 
of the particle. The probability that this angle is between ¢ and 
¢ + dp is sin 2¢d¢. The length of the corresponding track within 
the sphere is between 2rcos ¢ and 2rdcos¢ . Consequently the prob- 
ability that a particle entering the sphere under an angle of ¢ leaves 
at least one ion in it is 


1 — exp [— (2rcos ¢)/L]. 


The probability that a particle entering the sphere under any angle 
lonizes it at least once is 


7/2 
o(é)= { (1—e€ > 9) sin 24 de 


=1+4 264 ef 2¢2(1—e), (1) 


where = 2r/L. A similar relation shows that the mean length of 
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a track within the sphere is 47/3. Therefore the mean number of 
ions per track in the sphere is 47/3L. If the administered dose is 
measured as the mean number D of ions per unit of volume, the 
number of tracks crossing the sphere as a result of a dose D can be 
taken as: 


where v is the volume of the sphere. Therefore the probability P of 
ionizing the volume v at least once with a dose D is: 


ee ee (1b ) P0028": (2) 


Lea’s formula is the first order term of equation (2) expanded in 
power series of ®. Since 


’ 2; 
lim #(£) (4s)=1 
&>0 3 


we have P > 1 — ee’ as E > 0, which is intuitive since € = 0 corre- 
sponds to L= o . Similarly, P=0 forL=—0 and P=1forD—o. 
It is seen from equation (2) that In(1 — P) is a linear function of 
the dose D, a property well known in elementary theory where the 


° 1 2 3 4 5 6 7 8 9 19 


Figure 1. The function y(é) for § from 0 to 10. 
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0.2 0.6 | 1.4 1.8 
Figure 2. A semilogarithmic plot of the function ¥() for ¢ from 0.2 to 2. 
For  < 0.2 the effect of ionization density is negligible. 


effect of ionization density is neglected (L = o). Formula (2) can 
be written also in the form: 


In(l—P)s—< Ye) LAD, (3) 
where ¥(&) =— (a/4) & In(1 — ®). Graphs of the function ¥(&) 
are given in Figures 1 and 2. For small values of &, i.e., for sensitive 
volumes small with respect to the mean distance of two consecutive 
ions, we have 


P (€)L'=v(1— €/24 + ----) (4) 
which corresponds to: 
W(E) = (2/6) &(1— €/24+----). (5) 


In these formulae §/24 represents the correction term due to the 
ionization density. The corresponding term is —3 &/8 in Lea’s for- 
mula (loc cit., p. 86). For € < 0.2 we have 1 — &/24 = 1 with an 
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error smaller than 2 per cent, and because of formulae (8) and (4) 
PY (&)L* = v, n(1 — P) = — vD, which means that within this range 
of & the effect of density of ionization is completely negligible. This 
is intuitive since smaller values of € mean that the distance between 
the ions is large with respect to the diameter of the sensitive volume. 
For doses of D which are sufficiently small, P is small with respect 
to unity and we can write, instead of formula (3), 


P= W(é)L'D. (6) 

The formula given by Lea is (loc. cit., p. 353): 
P= (n/6)&L°D/F , (7) 
where he has tabulated F = 2&/(3@). For €=1 and é= 10 formula 
(6) gives P ~ 0.5 L°D and P = 307 L®D respectively, whereas Lea’s 


formula (7) gives P = 0.37 L'D and P= 77 L?D. Thus the differ- 
ence between the two formulae is substantial even for very small 
doses. The difference is more pronounced with high ionization den- 
sities, since then L is small and é is large. The proportionality be- 
tween P and D involved in Lea’s formula holds theoretically only for 
an infinitesimal dose D . 

The determination of the sensitive volume from experimental data 
can be carried out in the following manner: The value of LZ is known 
from the type of radiation used. Then a semilogarithmic plot of 1 — P 
against D, using formula (3), gives us the value of WY. From Fig- 
ure 1 or 2 the corresponding value of € = 2r/L is obtained, which 
gives the diameter 2r of the sensitive volume. This procedure was 
applied to a set of J: W. Gowen’s (1940) experimental data on in- 
activation of the ordinary strain of the tobacco mosaic virus by three 
different wave lengths of x-rays. A semilogarithmic plot of the num- 
ber of surviving viruses is linear, as required by equation (3), ex- 
cept at very high doses. This deviation from linearity may be due 
to the fact that at high doses a substantial fraction of viruses under- 
goes non-lethal mutations which are not taken into account in the 
above plots. Consequently the theory was applied to the linear part 
of the diagrams which cover about 500,000 roentgen for 4 = 1.5 and 
2.1 A°,and almost the full range of the dose for 4 = 0.7 A’, since the 
deviation from linearity in this latter case is quite small. The con- 
version of the dose in roentgen to a dose expressed as a number of 
ions per cubic micron has been made on the assumption of 1.5 ions 
per u® according to the calculations of Lea (loc. cit., p. 8) for virus 
protein. The average energy of the electrons ejected by the photons 
of 0.7, 1.5, 2.1 A° has been assumed to be respectively 14.2, 7.7, 5.7 
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KeV (cf. Lea loc cit., p. 12). Accordingly the mean distances L be- 
tween the ions have been taken to be 446, 265, 202 A° (cf. Lea, p. 
25). This gives us the following result: 


rA° x é 2r in 10-22 
0.7 0.12 0.62 2.76 
1.5 0.15 0.68 1.83 
2.1 0.17 0.69 1.39 


Although not very large, there is a systematic deviation from the 
theory since the latter requires a constant 27. Possible reasons for 
this deviation are: (a) too crude a calculation of the mean distance 
L between the ions, and (b) the fact that the wave lengths upon 
which the calculations are based are only average wave lengths. The 
average size of the sensitive volume as given by Gowen (loc. cit.) 
corresponds to 2r = 2.43 X 10?u. This magnitude does not take the 
ionization density into account, whose effect with the present wave 
length of x-rays is small in any event. 

If we disregard the effect of ionization density, that is, if we 
calculate v from the formula 


P=1—exp (—vD), 


we obtain practically the same v as if we take this effect into account. 
Independent of any physical interpretation of the coefficient v in the 
above formula for P, it is clear that v is a measure of the inactivating 
efficiency of the radiation, since a small v corresponds to a small P 
for the same dose D . The quantity v is proportional to the cube of the 
calculated diameters. Therefore, in the experiments discussed, the 
inactivating efficiency of the radiation decreases very strongly with 
the increase of the wave length. If the assumptions of the present 
calculations are correct, this change in efficiency cannot be explained 
by the purely geometrical fact that the ions are closer to each other 
when the wave length is longer, so that in the latter case the prob- 
ability is higher that within the sensitive volume more ions will oc- 
cur than are required for inactivation. There is, of course, the pos- 
sibility, which we do not analyze here, that when the ions are closer 
to each other, there is more likelihood for the products of ionizations 
to recombine with each other and to restore the irradiated molecules 
to their original form in this way. It is conceivable that such recom- 
bination phenomenon may decrease the efficiency of radiation with 
the increase of the wave length. 

We close this note by pointing out that, if absorption of a photon 
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21 al sat 0.7 at 


Figure 3. Linear relation between the calculated apparent diameter of the 
sensitive volume and the energy of the photon. Inactivation data of Gowen 
(loc. cit.) on the normal strain of the tobacco mosaic virus. 7 indicates a possible 
true diameter of the sensitive volume, according to a theoretical interpretation 
of the diagram. 


rather than of a single ionization is considered as the event leading 
to the inactivation of the virus, a linear relation between the diameter 
of the sensitive volume v, and the energy of the photon is obtained. 
Figure 3 represents such a linear relation obtained from Gowen’s 
data for the tobacco mosaic virus. The size of v, has been calculated 
by means of the formula 


1— P=exp (—,D,), (3) 


where D, is the dose measured as the average number of photons 
per cubic micron. The conversion to D, of the dose measured as num- 
ber D; of ions per cubic micron has been carried out by means of the 
formula D; = hyD,/e, where hy is the energy of the photon and « 
the average energy spent per ion. The value of 32.5 eV which has 
been taken for « is the currently used average energy per ion in the 
case of air (Lea, loc. cit.). The use of this value of e for virus is fur- 
ther justified by the fact that 44 different sets of measurements on 
29 gases and 1 liquid (Compton and Allison, 1946, p. 478; Mohler 
and Taylor, 1934; Thomson, 1933, p. 238) gave a value of ¢ from 0.77 
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to 1.54 times smaller than in air with a mean of 1.2; the value of ¢ 
found for air in most of these sets of experiments was 36.4 eV (meas- 
urements of Gaertner, cf. Compton and Allison, loc. cit.). 

A linear relation between the calculated diameter of the sensi- 
tive volume and the energy delivered by the primary particle per 
unit length of its track has been observed by E. C. Pollard and F. 
Forro, Jr. (1949). These authors explain this fact by assuming that 
the energy delivered by a particle along its track is transferred ra- 
dially from it at such a rate that the energy available in the medium 
per unit volume is inversely proportional to the distance from the 
particle track. Pollard and Forro suggest that a certain amount of 
this energy must be absorbed by the sensitive volume of the micro- 
organism to produce its inactivation. This energy has, according to 
their experiments, the character of an activation energy of a chemi- 
cal reaction. It equals 3.2 eV for the 7 — 1 phage and has been known 
to be 1.5 eV for mutations of Drosophila (Timoféeff-Ressovsky, 
1937). Pollard and Forro associate with each particle a “volume of 
action” which is characterized by the fact that sufficient energy is 
available for inactivation inside but not outside of its boundaries. 
This volume of action, which could also be called an apparent sensi- 
tive volume, is fundamentally a purely physical concept. It is this 
volume which, according to Pollard and Forro, is directly calculable 
from relation (8). 


On the basis of these concepts Pollard and Forro reach the con- 
clusion that the diameter of the true sensitive volume can be obtained 
from the above-mentioned linear relation as the apparent diameter 
which corresponds to a zero energy of the particle. They find in this 
way a diameter of the true sensitive volume of 2.8 <X 10-u for the 
T — 1 phage. The diagram of Figure 1 would give a true diameter 
of 1.3 X 10° for the tobacco mosaic virus. J. W. Gowen (loc. cit.) 
has also given inactivation data for the Aucuba and an ADO variant 
of the tobacco mosaic virus. If these are analyzed in a similar man- 
ner as has been done for the normal strain of the tobacco mosaic 
virus, a somewhat smaller, but still quite good, linear relation be- 
tween the calculated apparent diameters and the energy of the photon 
is obtained. The true diameters obtained by this analysis are found 
to be about twice as large as for the normal strain of the tobacco 
mosaic virus, bringing the average size of the true sensitive volume 


of these three variants very close to that found by Pollard and Forro 
for the J — 1 phage. 


Perhaps the exact statistical relation between the apparent and 
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the true sensitive volumes should be investigated more carefully than 
it has been up to now. Although the conditions of the two sets 
of experiments are very different from each other, since Pollard and 
Forro used heavy particles of few MeV and in Gowen’s experiments 
the ionizing particles were mainly photoelectrons of some 5 to 15 
KeV, the linear relationship noted in both cases suggests possible 
similarities of the fundamental physico-chemical mechanisms in both 
cases. In evaluating this possibility one should keep in mind the fact 
that Pollard and Forro found a linear relation between the diameters 
of the sensitive volumes and the dose delivered per unit length of 
track of the primary particle, whereas the present analysis indicates 
a linear relationship between the diameters and the energy absorbed 
per photon. If the basic mechanisms of inactivation are similar in 
both cases, that difference may be due, perhaps, to purely physical 
differences in the manner in which the energy of x-rays and deuterons 
is absorbed by the irradiated medium. 

This work was aided by a grant from the Dr. Wallace C. and 
Clara A. Abbott Memorial Fund of The University of Chicago. 
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The relative importance of diffusion and recombination of the prod- 
ucts of ionizing radiations is examined in several typical eases and 
methods for the solution of the differential equation of columnar ioni- 
zation are discussed. It is shown that Jaffé’s method is not applicable 
to radiobiological or purely radiochemical processes in which no ex- 
ternal electric fields are acting. A method based on a recent sugges- 
tion of Kramers (see Gerritsen, 1948) appears to be particularly good 
when recombination is strong. 


1. The background and the objectives of the paper. The fact 
that the amount of energy needed to produce a given biological effect 
depends on the energy of the ionizing particle has been noted in many 
cases (cf. Lea, 1946; Timoféeff-Ressovsky and Zimmer, 1947). The 
type of experiment in which the efficiency of the radiation is meas- 
ured as the number of microorganisms showing a certain effect per 
unit amount of administered energy is particularly appropriate for 
quantitative discussions. Theories have been suggested to explain this 
so-called wave length dependence of the biological effects of radia- 
tions. This can show two opposite types of relation: the biological 
efficiency of a radiation may increase or decrease with the increase 
of density of ionization. We are concerned here with the latter type 
of effect (for examples of the former see, e.g., Failla, 1949; Zirkle, 
1935). Among the earliest theories suggested to explain this wave 
length effect is one based on the concept of sensitive volume in which 
an ionization should occur to initiate a biological process. According 
to this theory an increase of the wave length of radiation may de- 
crease its efficiency, because with an increase of the wave length the 
density of ionization increases and, consequently, there is more 
chance that if a sensitive volume is ionized at all it is ionized more 
than once. Some experimental results are in agreement with this 
theory (cf. Lea, 1946). However, examples are known (see, eg., 
Opatowski, 1951) in which the effect of wave length as calculated by 
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the theory mentioned above appears completely negligible, whereas 
the experiment shows a very marked magnitude of this effect. Two 
theories can be considered to explain this and other disagreements. 
One may be based on a recent suggestion of E. C. Pollard and F. 
Forro (1949) (cf. Opatowski, 1951) which combines the concept of 
activation energy of a biological process with that of transport of 
energy delivered by the primary particle to the medium surrounding 
the sensitive volume. This leads to the concept of a sphere or volume 
of action of the primary particle which depends on the energy of the 
latter and is, in general, larger than the biological sensitive volume. 
The other theory considers a chemical or ionic recombination of the 
direct or indirect products of the ionizing radiation (cf. Lea, 1946, 
1947). This recombination may concern the positive and negative 
ions produced by the radiation or some groups of atoms produced by 
the ions. With densely ionizing radiations the ions and their products 
are closer to each other, so that the chance for their recombination 
is higher which would decrease the efficiency of a densely ionizing ra- 
diation. There are several complications which make a quantitative 
evaluation of this latter theory difficult. First of all, with the excep- 
tion of some simple radiochemical reactions, the exact nature of the 
processes induced by radiation is not known. This lack of sufficiently 
accurate quantitative knowledge unfortunately pertains to water also, 
an important substance from a biological viewpoint (a discussion of 
this subject with extensive bibliographical references can be found 
in Toulis, 1950). Second, the process of recombination is superim- 
posed upon a concomitant process of diffusion. Thus their quantita- 
tive aspects have to be studied on the basis of the theory of diffusion 
with the addition of the kinetics of recombination, which introduces 
a nonlinear term into the differential equation of diffusion. 


Notwithstanding the phenomenological uncertainties of the field 
and its theoretical difficulties, attempts have been made to apply the 
theory last mentioned to concrete questions of importance for an 
understanding of some radiobiological processes (see Lea, 1947). The 
mathematics used therein was based on a semi-empirical method of 
Jaffé (1913) for the solution of the diffusion-recombination equation. 
This method has been applied several times since its first publica- 
tion, but mainly for gases and only in the presence of external electric 
fields. It is shown in the present paper that in the absence of these 
fields Jaffé’s method leads to an inacceptable conclusion. Recently 
A. N Gerritsen (1948) noted that an application of Jaffé’s method 
for liquids, even in the presence of external electric fields, may lead 
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to errors. H. A. Kramers suggested a new method to replace Jaffé’s. 
A perturbation method was given by H. G. Landau (1950). These 
various methods, together with two new ones, are examined in the 
present paper with particular reference to phenomena in liquids in 
view of their importance in biology. 

The mathematical nature of the problem can be presented in 
its simplest possible form by considering the particular case in which 
the recombining ions or groups of atoms have the same diffusion co- 
efficient D. Then, if n is their concentration and t the time, we have: 


on/ot = diverad n — an’, (1) 


where a is a constant characterizing the time rate of recombination. 
Jaffé assumed that a short time after the passage of the ionizing par- 
ticle the ions or the groups of atoms formed by the latter were dis- 
tributed at random around the track of the primary particle. Accord- 
ingly, he gives 


n(7r, 0) =N. (a2 b?)- exp (—?r?/b’) (2) 


as the initial condition for the function n(7, t), where 7 is the dis- 
tance from the primary track and N, and b are two constants. If n 
is measured as the number of ions (or groups of atoms) of each kind 
per unit volume and / is the length of the primary track, Nol is the 
total number of ions of each kind at t = 0. Their mean distance 
from the axis of the primary track is b\/n/2. Expression (2) has 
to be interpreted as an average concentration along the track, since 
m is actually smaller at the beginning than at the end of the track. 


2. The relative importance of diffusion and recombination. 
Since equations (1) and (2) cannot be solved in finite terms, it is 
important to estimate the relative order of magnitude of the two 
terms on the right-hand side of equation (1) before choosing an ap- 
propriate approximate method of solution. It appears from Gerrit- 
sen’s work that for a-particles in liquids the recombination term an’ 
predominates over the diffusion term D divgrad n. But Gerritsen 
experimented at very low temperatures (liquid N,H,He,A) ; at room 
temperature the diffusion coefficient is likely to be much higher and 
no direct conclusion, for this temperature, can be drawn from his 
work. However, at t = 0 we have: 


. 2 2 
= Pores 2G =1 )exoy 3) 


K =e2s b? b? 2 


an? a No 
which expresses the relative contribution of diffusion and of recom- 
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bination processes to the change of concentration at r. It is seen that 
diffusion initially decreases the local concentration within r < Db, 
and thus cooperates with the recombination process; for r > b the 
opposite occurs. Both regions are important since at t = 0 the cylin- 
der r < b contains only 1 — e! ~ 0.63 of the total number of ions. 
For brevity we shall refer to the diffusing entities as ions, although 
they may consist of neutral groups of atoms. Across the cylinder 
r = b no diffusion occurs at t = 0. The total number of ions which 
at t = 0 are outside of the cylinder r = b is Ny exp(—?/b?). 


pif 
K/I 
40 
1.5 
30 
| 
20 
0.5 
10 
Ae 
ee iaisen meh r/ber 


eves I 2 
FIGURE 1. The relative ieponeanee of the diffusion with respect to the re- 
combination process immediately after the passage of an ionizing particle, at 
various distances from its track. Ordinate — |K| or K in units of I = 47D/(aN,). 


Abscissa = r in units of b or r2 in units of D2. 


Figure 1 gives two graphs of the function K . This function is 
practically constant within r < b/2 and changes very rapidly be- 
yond that, tending to 0 as r > o. Thus sufficiently far from the 
primary track the concentration of ions is so small that their chance 
for recombination is practically zero. 

The relative importance of diffusion with respect to recombina- 
tion is initially proportional to 


I=¢/N., where ¢=4aD/a. (4) 


Various theoretical expressions for the quantity ¢ can be given. 

We may use the relation D = kTu (valid for perfect gases and 
based on Fick’s law of diffusion), where wu is the mobility (velocity 
acquired under the action of a unit force), & Boltzmann’s constant, 
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and T the absolute temperature. We may consider together with th 
above Langevin’s relation between recombination rate of ions afd 
their mobilities u., wu. (see, e.g., Cobine, 1941, p. 99): 


a= 4ne7(u, + u_)/e, 


where e is the ionic charge and «¢ is the dielectric constant. Then the 
following formula is obtained for ¢ with the apenas D of equal 
mobilities of both ions: 


$=kT e/(2e2) = 300T «. (5) 


This formula would indicate a much higher importance of diffusion 
in water than in gases, because of the high dielectric constant of 
water. It would also indicate that the importance of diffusion with 
respect to recombination increases in direct proportion to the abso- 
lute temperature. Both these facts suggest that Gerritsen’s conclu- 
sions derived at very low temperatures for liquid gases, for which 
é = 1, may not apply to water at room temperature. In fact, in this 
latter case ¢ would be at least between 10* and 10* times larger than 
in his experiments. In addition to this, formula (5) concerns a purely 
tonic type of recombination. 

For processes between neutral groups of atoms the classical 
Arrhenius equation or the quantum theory of rate processes (Glass- 
tone, Laidler and Eyring, 1941) or other theories of diffusion (Bar- 
rer, 1941) and reaction rates (Moelwyn-Hughes, 1940, p. 537) lead 
to an expression of the type A exp(—E/kT) for both D and a, where 
E is an activation energy and A a constant which may depend on T. 
In the simplest case in which A is for both processes either inde- 
pendent of 7 or dependent on it in the same manner, the factor ¢ 
and, consequently, the relative importance of diffusion with respect 
to recombination, would decrease with increasing temperature if the 
activation energy for recombination is higher than for diffusion. This 
is an opposite type of relation than that expressed by equation (5) for 
purely ionic recombinations. It may be expected to occur frequently 
since activation energies for diffusion in liquids are relatively small. 

The dependence of diffusion and recombination on the type of 
radiation used is expressed in K through the density of ionization 
N,. The lower this density, the more important is diffusion with re- 
spect to recombination. From a theoretical viewpoint one would ex- 
pect that b also depends on the type of radiation since it is propor- 
tional to the mean distance of ions from the primary track. Higher 
energy particles produce 6-rays of longer range; consequently, one 
would expect that b should increase with increasing energy of the 
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primary particle. Similarly one would expect that b would be larger 
for X- and y-rays than for heavily ionizing particles. However, ex- 
perimental data on this subject are not available. 

Table I gives some numerical information on the relative im- 
portance of the diffusion and recombination processes a very short 
time after the passage of the particle or photon. It is seen that the 
effect of recombination is small in air, but not in water and tissue 
in the case of heavily ionizing particles. It is noteworthy that none 
of the conclusions of Table I depend on Db, that is, on the radial dis- 
tribution of ions. 

3. Methods of successive approximations for the solution of 
the equations. We now proceed to a discussion of approximate meth- 
ods of solution of equations (1) and (2). To simplify the formalism 
we introduce the following symbols: 

A=4)/07.— B=7077No I —0/ (AB) Ale 
R=(7b)2 71 — Age i ele ee (6) 


Equations (1) and (2) then become, using the assumption of a cylin- 
drical symmetry of 7, 


Ot eal res : 
f(R ,0) =exp(—R). (3) 


Or, in the equivalent integral form, 


sf iee af 
f= [ featencd ap | at + exp(—R). (9) 


If recombination predominates over diffusion we can take 
fn =1/(oT + exp R) 


as a first approximation to f. The function f = f,, is the solution of 
equations (7) and (8) when D = 0. The next approximation f = f, 
is obtained by substituting f = f,, in the right-hand side ee equation 
(9). This gives us: 


fea=fri(l—T + TRe®f,.). (10) 


This process can be continued and the integrations involved in 
any of the successive approximations can be expressed in finite terms. 
This is essentially a method suggested by Kramers. 
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If diffusion predominates over recombination one can proceed 
in a similar manner. As the first approximation to the function f 
one can take : 


fo: = M exp (—R/M), 


which is the exact solution of the system (7)—(8) in the case of zero 
recombination. Then the next approximation follows from equation 
(9): 

foe = fox a GF (2k,M) , 
where Ff, (2R,M) = (e?*/* — e**) /2R. The function F> is one of the 
following set of functions that we will encounter later: 


aM 
F(x ,M) = | we*/" du= ME, (x/M) — E,, (a), 


1 


vl 
E,(x“) = | rer EG. 


0) 


Tables of G. Placzek (1946) are available for the function H, and 
from the Mathematical Tables Project (1940) for the exponential 
integral E,(x) = — Hi(-~). 

An intermediate approximation is obtained if one starts from a 
suitable combination of fp, and f,, as a first approximation to f. For 
instance, one can take as such a first approximation: 


fa= A—A)fot hfn, (11) 
which satisfies equation (8) independently of h. If 
h=[1+ (R—1)97e7 17, 


equation (7) is also satisfied at 7 = 0. Equation (9) then gives the 
next approximation to f: 


fd = PEs 8) + ; (1—e*fa) (Re*fa—1) 
1 R 
+ (Rh +) Sgr eer kw | 
g g 
+ 2Rh'[F,(R, M) =e (e® —fn)] 
h? 
—9| = hyFo2R.M) + (ctf) 


: eF + —-R 
+ 2h(L— hy perPeP( ARe®, om) aed 
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where p = 1/(e* —g), h' =dh/dk, h* = d?h/dk*. 


4, A numerical discussion of the successive approximation 
methods. Two successive approximations to f are good if they do not 
differ very much from each other [cf. equation (9) ]. Let us consider 
ions around the primary track (R = 0). Since 


{2(0,T)/faW, T) =—1i—T, 


fr and f,. are good approximations to f for sufficiently small 7 . Since 
t = 7T/D, small T means longer time if diffusion is smaller. Consider, 
for instance, the time that would be required to reduce the concen- 
tration at R = 0 to 10% of its initial value if diffusion were com- 
pletely negligible. It follows from the expression of f,, that this time 
would be in dimensionless units 7 = 9/g. Among the examples con- 
sidered in Table I the one in which the effect of diffusion is the small- 
est is that concerning H, OH, and 1 MeV a-particles for which 
I =1/g = 0.03. This would give us T = 0.27. Thus within the time 
considered the error with which f,.(0, 7) may be affected can be 
estimated as being smaller than 27%. According to a suggestion of 
Lea that b = 10-* em, we find that 7 = 0.27 corresponds to ¢ = 1.35 X 
10° sec. For intervals of time twice as large, f.(0, T) < 0, the 
method fails. The time t¢ that enters into the present theory is not 
counted from the moment of passage of the particle, but from the 
moment at which the ions produced by the particle have acquired 
distribution (2) and have lost so much of their kinetic energy that 
their motion is subject to the laws of diffusion. It is difficult to esti- 
mate this moment in an accurate manner. Landau (1950) shows that 
if the energy of the primary particle is absorbed instantaneously it 
takes b°/4D sec for the ions to acquire distribution (2) under the 
assumption that recombination has not acted during this time. With 
the «values of constants used here that time would be 5 X 10~° sec. 
Perhaps a more informative comparison is obtained if, instead of 
local concentrations, the total number of ions (or atom groups) is 
considered. This number, expressed as a fraction of the total num- 
ber initially present, is: 


N(t)/No= ihe F(R, T)dR= [In(1 + gT)]/(9T) 


in both first and second approximations, i.e., using fr or fe for f. 
This is to some extent intuitive, since diffusion, which the method 
treats as a second order effect, changes the distribution of the ions 
but not their total number. In the numerical example just consid- 
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ered when the concentration at the primary track is somewhere be- 
tween ~ 0.075 and 0.1 of its initial value, the total number of ions is 
reduced to ~ 0.26 of their initial number, according to the above for- 
mula, independent of the value of b , whose only effect is-to fix the 
scale of length and of time. In the third approximation we have from 
equation (9) 


co D spel 
N(t)/No= if fisdR=1—g { dR | fe2dT , 


because Rof,./dR is zero for both R = 0 and R = ow. Taking equa- 
tion (10) into account we have: 


a ea} oT ew if (era) 
| aR | fodT — | (1—T)2dT | 42 dR 


0 


a fa ee) 
+2 { T(1—T)aT | Re®f,*dR 


me DD Hh 
+ [ Reetar | Tf,'dT. 
/J 0 0 


If we put gT = 7 and use the substitution 7 + e* =u, we find: 


oo “R ef dR 1 In(1 ain tT) bi 
2 a <3 — __ 2 
f fatdR=2 | ef | ; pe 


~ sere Ree'dR ie i (nw) * 

oR of 4 ——— a Steeceet 

f Re*dR if Tf,‘ d 2 j (gT + e®)* Sat (Ww a r)* 
i 


wi od =) 
oA Or wel 


where w —e*, 


a oe —2in(t 4) Inw, 
Wr T 


L being a tabulated function (see Fletcher, Miller and Rosenhead, 
1946, pp. 343-44; Powell, 1943; Fletcher, 1944) defined by 


2 bales 
L(x) = [{ vk 


——— 
S 


1 


Taking into account that (Powell, loc. cit.) 
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lim [L(a) — }(Inz)2] => 


foo 


we find Jie0 = (22/3) + (In r)?. From this, using the relation (Pow- 
ell, loc. cit.) 


2L( 1 + =)== (in a= Ui e 
o 
we obtain: 
fo) 7 
il Ree"dR | Tf,,4dT =T[L(1 + 1) —In(1 + 7))/(392)- 


Finally, for the total number of ions in the third approximation we 
have: 


NCE) mel (Ais ea T 
= + rr) — T)]. 12 
= - as 3 (La + Nes Nabe ay pe Wey 


The first term on the right-hand side of this equation is identical 
with the expression of N(t)/N, in the case of pure recombination. 
Thus the term containing 1/g represents the effect of diffusion. 
Through approximate calculations based on Lea’s estimates we 
found that the total number of H and OH produced in water by a 
1 MeV a-particle would be reduced to 26% of their initial number 
within ¢ ~ 1.85 X 10° sec if diffusion were absent. Carrying out 
calculations with formula (12), which takes diffusion into account, 
we find that fraction to be ~ 30.5%. (In evaluating the meaning of 
these figures one should keep in mind that ¢ = 0 corresponds to a time 
which is somewhat later than the moment of passage of the particle.) 
For large T’s formula (12) fails because it gives N(t)/No < 0. It 
is not difficult to derive a third approximation formula also for the 
concentration at R = 0. From equation (9) we have: 


fT 


0 fre 
r3 A yew & =f. ——— mise | 
fs(0,T) +f ES of | a 


=-_|4 
Ls. 


2 In(iee | : 


Carrying out calculations with the data just considered (1 MeV 
a-particle in water), one finds the following: Recombination alone 
would reduce the concentration at the primary track at T = 0.27 to 
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10% of its initial magnitude; taking diffusion into account would give 
a concentration of 7.3% by the first iteration formula and 7.55% by 
the next one, just derived for f,;. The closeness of these two last fig- 
ures speaks well for a good convergence of the method. The relative- 
ly small contribution of the diffusion process to the reduction of con- 
centration at the primary track is in agreement with the results of 
Table I. 

If recombination is small with respect to diffusion we have at the 
primary track: 


foo (0, T) =M# — g(1— MM): 


A first limitation to the applicability of this formula is that fp.(0 , 7) 
be > 0, which requires T < 1/g. This also appears from the ratio: 


PhO! )/to.(0, 7) — 1 gf. (13) 


In the case of a complete absence of recombination, the concentra- 
tion at Rk = 0 is reduced to 10% of its initial value at T = 9 and to 
1% at T = 99. This can be easily derived from the expression of 
foi(0, T). For 2 MeV y-rays, using Jaffé’s experimental data for air, 
these values of J correspond to gT = T/I = T/3683 = 2.4 X 10-° and 
2.7 X 10 respectively. Thus in this case the method would seem very 
good, as far as the concentration around the primary track is con- 
cerned. Its disadvantage is that the second order approximation does 
not reveal any reduction in the total number of ions, and the third 
order approximation is difficult to calculate. The previous method 
would give N(t)/N. = 98% in the present case using formula (12) 
for T =9. Thus with 2 MeV y-rays in air diffusion is so strong that 
ions at the primary track are reduced to a small fraction of their 
initial number therein during a time when ionic recombination has 
almost not yet taken place. For T = 99 formula (12) fails because 
it gives N(t) < 0; the method over-emphasizes recombination. 

The disadvantage of the intermediate method is that the calcu- 
lation of the total number of ions requires involved numerical inte- 
grations. AtR=0, 


fal0s 2y=—17[(1 + T) (1 + of)];, 


gT 
TP ol. 


2 1— 
~_+ | ge +f in(o + ))- 
GQ=—@)2its 2 - tof 19 


fie(0, T) =fii(0,T) ate 
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Thus at the primary track this method represents. the concentration 
as the product of the concentrations that would exist there if diffu- 
sion and recombination were acting separately. 

5. Comparison with other known methods. Jaffe (1913) gave 
a semi-empirical solution of equations (7) and (8) which is: 

_ 1 exp(—R/M) 
. f M 1+ (g/2)mM~ 

The use of this solution is equivalent to changing g in equation (7) 
to (g/2)exp(R/M). Thus at R= 0 only 50% of the effect of recom- 
bination is taken into account. Since diffusion is taken into account 
to its full extent, the error is more pronounced when recombination 
is strong, which is in agreement with Gerritsen’s observations. How- 
ever, even when recombination is small with respect to diffusion 
Jaffé’s method does not yield the correct order of magnitude of the 
total number of ions. One can see this by considering Jaffé’s formula: 


N (t)/No= [1 + (g/2)In MJ]. 


Keeping in mind that g = 1/I, the time necessary to reduce the total 
number of ions to 10% of their initial number is, according to the 
above formula, equal to T = exp (187) —1. Using Jaffé’s experi- 
mental data for air this time becomes in absolute units in the case 
of 1 MeV a-particles (see Table 1; b= 1.79 X 10° em, A = 4.62 X 10¢ 
sec!) : 


t= [exp(18 X 4.46) —1]/A =5 X 10” years ! 


As high as this time is, the formula would make it still higher for X- 
and y-rays. This shows how misleading an application of Jaffé’s the- 
ory may be when no electric fields are present. (In all experiments 
in which Jaffé’s theory was used, ions were subject to an external 
electrostatic field, and no such serious contradictions with the theory 
have been noted.) 


The method of Landay (1950) amounts to expanding f in power 
series of g. At R=0 his solution is: 


f(0,T) =S gn (0,7) 


m=0 


where f,(0,7) =1/M , 
f,(0, T).=— [In(2M —1)]/(2M), 
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| 2 1 
2Mf.(0, T S10 ea ious 
) 38M Ms M+o 


1 M+o-—1 ei | 
—Le-il— +L ———— }) + ——— 
( 7.) ( o—M ) L( o+ M ) 


—L(5,—8)—m3-mM—Lays) + (n/2)?, 


with co = \/ M*? — (M/2) and L the function that we have already 
encountered. (In using Powell’s tables for it the formula 


L(-y) =Iny-In(1+ y) —L(1+ y) — (27/6) 


is helpful.) This method requires that either g or T, or both, be suf- 
ficiently small; otherwise /, + gf, would be < 0. The method has a 
disadvantage similar to Jaffé’s. The total number of ions up to the 
terms of first order of this method is 1 — (g/2)InM . Using the same 
example as in discussing Jaffé’s theory we find that it would take a 
time T = e** — 1 to reduce to 10% the total number of ions produced 
in air by an 1 MeV a-particle. This is an inacceptable result for the 
same reasons as before. The situation is not improved by considering 
terms of 2nd order in the expression of N (t)/N, . Although this meth- 
od is not appropriate for calculating the total number of ions it yields 
convergent results for concentrations around the primary track if 
g << 1. For instance, for the previously encountered value of T= 9, 
i.e., M = 10, we have 


f(0, T) =0.1 — 0.1479 + 0.4339? + ----. 
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Following a suggestion made previously (Bull. Math. Biophysies, 
13, 61, 1951), it is assumed that every individual has both a tendency 
to behave arationally, by accepting everything on faith, and rationally, 
by subjecting everything to rational analysis. Arational behavior is 
characterized by various beliefs, prejudices, etc., which are considered 
to be conditioned reactions, learned by the individual before he com- 
pletely develops his faculties for rational thinking. The two tendencies 
are assumed to be due to excitations of two different regions of the cen- 
tral nervous system, and are measured by the intensities e, and €, of 


those excitations. Those intensities are further assumed to increase lin- 
early with time, the increases of the two beginning, in general, at differ- 
ent ages. The rates of increase are considered as normally distributed 
in the population.The relative frequency of arational and rational be- 
havior is determined by the difference ¢ = &; = &, according to equa- 
tions developed previously (Bull. Math. Biophysics, 11, 255, 1949). 

It is shown that with the above assumptions the majority of the pop- 
ulation, which starts with arational behavior, will, within two or three 
of generations, either change to rational behavior or continue indefinite- 
ly to behave arationally. This will hold as long as imitative factors are 
present. Expressions for the numbers of individuals who behave ra- 
tionally and arationally are derived. If the intensity of conditioning to- 
ward an arational behavior decreases with increasing size of the ration- 
ally behaving minority, or, if the rationally behaving individuals are not 
influenced by imitation, then a slow secular trend toward rational be- 
havior may be present. An expression is also derived for the fraction 
of individuals who behave rationally as a function of age. This frac- 
tion increases with increase of the age at which the beginning condi- 
tioning toward any beliefs or prejudices begins. 


In this paper the attempt is made to further develop the sugges- 
tions made in a previous publication (Rashevsky, 1951b) and to de- 
rive in closed form some expressions which, in principle at least, can 
be verified by observation. The notations used here are somewhat 
different from those used in loc. cit. 

As before (Rashevsky, 1951a,b), we consider a neural mechan- 
ism in which there are two cross-inhibited centers, one for behavior 
based on acceptance of everything on faith, the other for behavior 
based on rational analysis of everything. Each center is character- 
ized by a level of spontaneous excitation, «; for the former, ¢, for the 


latter. The difference 
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p— Ef — &r (1) 


determines the relative frequency of one behavior or the other, ac- 
cording to previously developed equations (Rashevsky, 1951la). 

We again consider acceptance on faith as a conditioned process. 
Various beliefs, prejudices, and religious views are of the nature of 
conditioned reflexes. If strongly conditioned at an early age to a spe- 
cific religious belief, an individual may have difficulty in abandoning 
this early established belief, even if he later develops critical reason- 
ing abilities to a high degree. 

Some beliefs or prejudices may be more strongly conditioned 
than others. Therefore, as in loc. cit., we shall assume different val- 
ues of «, for different situations. 

Again, as in loc. cit., we shall assume that the quantity e, , which 
measures the strength of conditioning, increases linearly with time, 
beginning at a certain age 7 at which the conditioning is started; or, 
if we prefer, at which the indoctrination of the belief or prejudice 
is begun. The rate of increase is proportional to a value «, which 
measures the innate ability for conditioning or learning, and to a 
factor b, which measures the intensity of the indoctrination. Thus, if 
an individual is born at the time 7 and the conditioning begun at the 
age 7), at any time t we have 


e, = be. [t — (7 + to) ]. (2) 


As to the quantity e,, we assume that it increases linearly with 
time, beginning with a certain age 7*, so that, with 4 as a constant, 
we have 


é, =A[t— (7 + 7*)]. (3) 


Both the assumption of linearity and the assumption that the 
development of rational thinking begins at a sharply defined age 7* 
are crude approximations. Subsequently we shall outline a possible 
generalization which will make the situation more realistic. 

We shall for the time being consider the case in which 


to <7", (4) 


though subsequently we shall drop this restriction. 

We shall write $(t; 7, 7, 7*) instead of ¢ to indicate that ¢ is a 
function of ¢ and of the parameters, 7 , 7), and 7*. When necessary, we 
shall use a similar notation for e; and e, . 

Equations (1), (2), and (3) give, fort >7++7°: 


b(t5 7,19, 7°) = beo(t—71— 7) —A(t—7—7"), (5) 
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which may also be written: 


p(t;7,70, 7°) = beo(r* — 719) + (be —A) ((—7—7"). (6) 
For 7 +7 <t<7+ 7° the quantity e, is zero, and therefore, 
¢ = be. (t —7r— 7). (7) 


Finally, for; <t¢<r+7,¢=0. 

In a large population consisting of N individuals, the quantities 
é) and 4 vary from individual to individual, as does the quantity b. 
We shall, however, consider 6 as the same for all individuals, as a 
first approximation. As will be apparent from the subsequent devel- 
opments this does not mean a loss of generality. 

Consider the case in which the distribution of both e and 4 is 
normal with corresponding means e and 4 and standard deviations 
Oo; and o,. 

In a group of individuals born at z the mean value ¢ of $(t;7, 70; 
7*) fort >7 + 7* will be [cf. eq. (6)]: 


b(t; 7,70, 7°) =be(r*— 1) + (b& —A) (E—1—7’). (8) 
For7+7<t<7+7* we have 
| $=ba(t—r—2), (9) 


and for; <t<r+m,¢=0. 
For the standard deviation o,(t) of the quantity «; we have for 
De PSP ak 
oi(t) = bo;(t—1r— 7). (10) 


Similarly, for the standard deviation o.(t) of ¢, we have for 
Gens ts: 


C2 (t) =o,(t—r— 7"). (11) 
For t > + + 7* the standard deviation o of ¢ is given by 
o*(t) =o,?(t) + 027(¢). (12) 


Of all the individuals in the society at the time ¢ we shall con- 
sider now only those who were born between 7 and r + dr, or, as we 
shall say for brevity, those born at the time 7. 

The values of ¢ of those individuals will be distributed normally, 
the average ¢ being given by expression (8) or (9) and the stand- 
ard deviation by expression (12). Let N(r)dz be the number of in- 
dividuals born at the time +. Furthermore, let f(7,t) represent the 
fraction of individuals born at the time 7 and surviving at the time 
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t. Then the total number of individuals born at the time 7 and still 
living at the time t is N(7)f(7, t)d7. Of those individuals a number 


Pld Te), axe [p(t;7,70,7°) —$]* 
V 22 oa (t) 2[o,?(t) + o2?(t) ] 


will have a ¢ between ¢ and ¢ +d¢. 

To obtain the total number of individuals at the time ¢ which has 
a @ between ¢ and ¢ + d¢ regardless of when they were born, we 
must integrate expression (13) with respect to 7 after substituting 
the expressions (8), (9) (10), (11), and (12) for $(t; 7, 70, 7°), 
o,(t), o2(t), and o(t). The integration is to be taken from 7 = 0 to 
7=t,if r= 0 is the moment at which the social group is first formed. 
Since $(¢; 7, 7, 7") is not represented by a single analytical func- 
tion in the whole interval (0,¢), we must integrate expression (13) 
from 0 to t — z*, using expressions (8), (10), and (11). From ¢ — 7° 
to t — rt) we integrate using expressions (9) and (10) and putting 
o, = 0. Finally, in the interval (t — 7, t) we have $ =o. = «..=—0; 
that is, in that interval the integrand is zero. 

Unfortunately the evaluation of these integrals in closed form 
seems to be impossible. We shall therefore use a rather crude ap- 
proximation, which will not alter the essential characteristics of the 
relation sought or the order of magnitude of the results. 

We shall assume that the distribution of ¢ at the time ¢ for the 


whole population is normal with a mean value $(t) of ¢, which is 
equal to the time average of ¢ in the interval (0,t), and with a stand- 
ard deviation « which is given by 


do (13) 


o? = 94? + o4?, (14) 


where o, and o, are time averages of o,(t) and o2(t) in that interval. 
It is more reasonable to take the time averages o,? and o.? of the 
squares, but the final expression is not changed much by this. 
In computing these time averages we shall assume an exponen- 
tially increasing population, so that 
N (r) = Noe, (15) 
and put 
f(r, t) =e, (16) 


Remembering again that $(t, +, 7, 7°) is given by different 
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expressions for 0 =< {\<i. ropa ty < <r and t > + + +* 
and putting 

at+p=y, (17) 
we find 


$(¢) =( ev" 8 [bee (r*— 10) + (b&—1) (tr — 2") Ide 


fe (18) 
ee a ear 
t-T* C0) 
Evaluating the integrals, we find: 
$(t) =] Sern) (err Cy =x) 
beg — 2 
ig 5 ev(-7") — 4 (t—7*) —1 (19) 
be 
ee ee viz. —7s) ler | ey i evr —1]. 
y- 


With increasing ¢ the quantity n (t) tends asymptotically to: 
rag A 
Pee + Be e-Y7, (20) 
Y Me 

If instead of considering 7) < 7*, we take 7 > 7*, all we have to 
do is to interchange 7* and 7 everywhere in the preceding equations. 
Equations (24) and (25), and all following from them, remain un- 
affected by this. 

If we consider plausible values for a, f, and y to be of the order 
of magnitude of 10 year, then inspection of expression (19) shows 
that the asymptotic value is very closely approached when ¢ > 1/y ~ 
100 years. Thus, if all the parameters are constant, the value of ¢. , 


given by (20), will be reached within 2 or 8 generations and will 
remain constant and independent of the initial conditions at 7 = 0. 


The maximum value that 6 could have at an early time is be, (r* — to) 
> 0.A bias is then present in favor of arational (Rashevsky, 1951a, 
chap. xxi) behavior, characterized by beliefs and prejudices. If 


Ae beget), (21) 
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then Se is also positive. The bias for arational behavior persists in- 
definitely. 
Only when 
Abeer (22) 


is a < 0, and the bias becomes in favor of rational behavior. 

If we apply the theory of imitative behavior (Rashevsky, 1951a) 
to the present case, we would expect that, provided N, is large enough 
in its initial stages of development, society will exhibit predominantly 
arational behavior. If after a few generations a negative asymptotic 


ae is approached, society will swing into a predominantly rational 
behavior, provided |oo| > |¢*|, where ¢* is the necessary threshold. 
This requires 


A> y \p*| ev + De ev), (23) 


Expression (23) shows that, if the society is originally in a con- 
figuration which corresponds to arational behavior, even if b= 0, 
that is, if no prejudices are taught, it still will remain indefinitely in 
that configuration unless 


Aye er". (24) 

We shall now proceed to find a simple expression for ¢*, as well 

as expressions for the number X and Y of individuals who behave 
arationally and rationally. Before doing this we might remark that 
by an argument similar to that which led to expression (20), we find 


the standard deviations o; (é) and a(t) also tend to asymptotic val- 
ues: 


= __ be; TZ Cr 
a eae 


Tic Cpei ans (25) 
He fe 
The asymptotic value of c is given by 
Bai 2— 1 b2 2 + 2 
To" = 2 (b? o;? + 4,7). (26) 


As has been shown previously (Rashevsky, 1951a, chap. xii), the 
behavioral configuration of the society is determined by the point of 
intersection of a certain curve and a straight line ay. It has been 
shown by H. G. Landau (1950) that, if the distribution function of 
¢ is normal with the mean ¢, and standard deviation ¢, and if the 
distribution of the random fluctuations in the central nervous sys- 


/N. RASHEVSKY 295 


tem (Rashevsky, 1951a) is also normal with a standard deviation 
s , then the curve is represented (Landau, 1950) by the function 


Tie (ieee ae (27) 
verte 
where 

G(x) = (2n)"! | exp (—a?/2) da’, (28) 


it) 


and where N denotes the total population. 

As we have seen in a preceding paper (Rashevsky, 1951c), if N 
is very large a constant independent of N should be substituted for 
it into equation (27) and all the following ones. This constant will 
be absorbed into the constant A. The quantities X and Y, to be used 
subsequently, now represent the fractions of the total population 
which exhibit arational and rational behavior respectively. 


The threshold value ¢* is that value of ¢) for which the straight 
line ay becomes tangent to the curve represented by the function (27) 
(Rashevsky, 195la, chap. xiii). This requirement gives us the fol- 
lowing two equations: 


a=2A eye Sea 
Tae (29) 
and 
— 
eee AG a (30) 
Vi eee 


Because of (28), equation (29) gives us 
A *)2 
Or ss exp | UT : (31) 
\/ 2n (8? + 0?) S*toF 


which in turn leads to 


(3? + o) log 


2A ; 
(yp + ¢*) = | s 


av/2n(s? + 6?) 


Introducing (30) into (32) we now find for the abscissa of the 
point of contact , 


2A 
ee a Like ae (33) 
a wal ore (2 +o) 
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Introducing this expression for y back into (32), we find 


- 2A | 
g° =| (s? + o) log —————_ | 
av/2n(s? + 0°) | 
(34) 
2A — 2A | ) 
——— Gf } log ———____ 
( av/2a(s? +o) | 


As A increases G tends to unity, while the absolute value of the 
first term of the right side increases indefinitely. Therefore ¢* in- 
creases indefinitely with A, as Vlog A. With increasing s or o the 
first term of the right side again increases, while G tends to — 1. 
Thus, on the positive branch where ¢* is negative, it goes to —o as 
either s or « increases indefinitely. 

Introducing expression (34) into inequality (23), we obtain the 
condition that a society would within a couple of generations adopt 
a preponderantly rational behavior. Since none of the known soci- 
eties behave preponderantly rationally, because superstitions, re- 
ligious creeds, and prejudices of various kinds are still characteris- 
tic of the majority, we may draw the conclusion that inequality (23) 
is never satisfied. 


We shall now derive an expression for the fraction X of indi- 
viduals who exhibit arational behavior after society has approached 
the stationary state which is characterized by the values given by ex- 
pressions (20) and (26). To do this we must solve equation (30) for 


y when ¢. = ¢~ and o =o. . Since the G() function is similar in its 
general behavior to the function 1 — e*, we may use the method of 
successive approximations suggested previously (Rashevsky, 1951a, 
pp. 91-2). We shall consider the case in which a is sufficiently small 
and therefore X is close to unity, and we shall use only the first ap- 
proximation, which is obtained by putting y = A/a in the right side 


of (30). Thus 

ie cor ¥e) A+ a bx 

hee ee . (35) 
@ Nas toe? 


If Y is the fraction of individuals who exhibit rational behavior, 
then (Rashevsky, 1951a, chap. xiii) we have 


ee Yoe= I (36) 
and 
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ay 


A—Y= (2 
A (37) 
Equations (36) and (37), together with (35), give 

1 At+ag, 

Kap [1426 Bee ee (38) 
as? + 6,2 
and 
1 AY le a a 

== = ee : (39) 

aS? F007 / 


As ¢,, tends to + pone l). Y = 0 As A — co we also have Xx 1 


Since both Fe and o.. are approached within a rather short time 
(of the order of one generation) after a few generations, regard- 
less of the initial condition, the fractions X and Y become practically 


constant. If peo is either positive or not sufficiently negative, then, if 
arational behavior has been adopted initially by the society as a 
whole, it will persist indefinitely, unless inequality (23) is satisfied, 
which, as we have just concluded, is not likely. 

The situation, however, becomes different if we take into ac- 
count possible variations of b, which measures the intensity of in- 
doctrination with a given belief or prejudice. This intensity may be 
considered on the average as being the smaller, the larger Y, the rela- 
tive number of individuals who behave rationally. Any individual is 
exposed at the age 7 both to the effects of the fraction X of arational- 
ly behaving individuals and the fraction Y of rationally behaving 
ones. It is true that some individuals, born in an arationally behaving 
family, may be more shielded from the rationally behaving individ- 
uals than others. But on the average it is a fair approximation to put 


b= bo (LAY), (40) 
where b, is a constant. 


Now let 2 > be,, so that in the course of the first few genera- 
tions a (t), while remaining positive, decreases. This decrease in $(t) 
will result in an increase of Y. Such an increase in Y will result in a 
decrease of b , and therefore in an increase of 4 — be,, and hence in 
a more rapid decrease of ai This further enhances the increase Oley. 
We shall now investigate whether under those conditions society will 
eventually adopt a preponderantly rational behavior. 
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This process of decrease of b is a very slow one, at least when 
Y is very small as is the case in initial stages. An appreciable change 
in b is to be expected only after a time which is much larger than 
1/y. Since « approaches its asymptotic value So approximately with- 
in the time 1/y, in our subsequent argument we may consider that 
the value o. has been reached and remains constant. If the change 
is very slow, so that b does not vary much during the time t, = 1/y, 
we may consider the whole situation as a succession of stationary 
states, and ¢. as a slowly varying function of time. 

The precise formulation of the above idea leads to rather hope- 
less looking functional equations. The following approximate treat- 
ment gives us, however, an answer to our question. We have from 
(40), (89), and (28) 


db ay 


= SS (41) 

xl eeealiea 

= siete ee ee aye 
| Vane how ») 2a? (s? + 4°) |] dt 

The asymptotic value given by expression (20) is approached ap- 

proximately after a time 1/y. As we have seen, the initial value of 


¢(t) is approximately be,(7* — 7) ; the asymptotic value is given by 
(20). If we assume 7* and 71, to be of the order of magnitude of a few 
years, then, because y ~ 10~ year, we can expand the exponentials in 
expression (20), retaining only the linear terms. We then find that if 


(22) is satisfied the increment of o(t) during the time 1/y is ap- 
proximately —(i — be,.)/y. Hence roughly the order of magnitude 
of d $(t) /dt is 


d$e __ B(A—B &) 


Sn ICT < Os (42) 
Introducting this into expression (41) we obtain: 

db A—b.& 1 

dt y V/2n(s? + o2) 


(43) 


ae _ (tag)? 
2a? (s? +o,,2) 
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which shows that b does decrease eventually to zero. 
In equation (48) ¢. is itself a function of b , according to equation 
(20). As b decreases, Pua tends to a negative value — top ev. For 
of 


sufficiently small values of b, when the asymptotic value is almost 
reached, the exponential term in equation (43) remains practically 
constant. In that case equation (43) becomes of the form 


=—K(A—bdea). (44) 
Integrated with the initial condition b = b, at t = 0, this gives 


i A u 
b=— A— (A— Dom) ef | (45) 
& 


According to this equation, b becomes equal to zero for 


1 


(46) 


If condition (24) is satisfied, then sometime before the time giv- 
en by (46), the society will change to a rational behavior. 


A secular trend toward a change to’a preponderantly rational 
behavior may be obtained, however, if we consider the possibility 
that an individual who chooses rational behavior also becomes free 
from imitative influnces, a situation which is psychologically plaus- 
ible. In that case the offspring of the fraction Y of individuals will 
have a zero b and will not be subject to imitation. Then the situation 
will be described by equation (24) [ef. loc. cit.] into which we now 
introduce a for §, and use Y, as given by equation (39), for a. In our 
present notation equation (25) of loc. cit. reads 


“1 — em, i (47) 
N 
where n, is the total progeny of the rationally behaving fraction of 
the population, as distinguished from Y, which is the fraction of in- 
dividuals born of the arationally behaving majority but which adopts 
rational behavior with age. 
The above conclusions hold, however, only for the case of imita- 
tive behavior. If each individual acts quite independently, the situa- 
tion is different. With sufficiently small values of 
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a,c; and o,, a relatively small positive value of A (t)—say, the initial 
value of b &)(r* — ro) —may result in the vast majority of the popula- 


tion choosing arational behavior. If on is still sufficiently positive, 
then most of the population will still exhibit an arational behavior 
after the asymptotic state is approached. However, the quantities »% 


and Y now vary continuously with a . We now have 


x=4/1+2¢(—=— || ; (48) 
\/ Son 


pee =) (49) 
V/ 8? + oa? 


If we now make the assumption expressed by equation (40) and 
repeat the argument following that equation, we find, instead of 
(43): 


db B(A— b &) i ¢ 
Se ek ee (50) 
dt y? \/ 2 8? treo?) 2(8*-t oa) 

Again b decreases to zero. From equation (20) we see that eventually 


$2 becomes equal to — (A/y)e*, which is negative. The majority 
will exhibit rational behavior. 

As a last problem we shall consider the following one. Consider 
at a time ¢ all individuals with an age between 6 and 6 + dé, with 
6 > r*. For all these individuals ¢ will be distributed normally and 
the mean value will be given by equation (8) in which we put 
t—+r=0. Hence it will be given by 


$ = b &(r* — 7) Fas (b & — A) (0 ear) (51) 


The standard deviation, according to equations (10), (11), and 
(12), is given by 


o = Vb? a7? (0 — t0)? + 0,2 (6 — 7*) 2. (52) 


pen If there is no imitation in the group, then the fraction Y of in- 
dividuals who at the age @ behave rationally is given by 


v=4|1—20( b &(7* — 7) + (b &— 4A) (6 — 7°) Cx 
V+ BF oF (O10)? FoP(O—T)/] 
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In case of imitative behavior, the value of ¢ for each individual 
is increased by the amount y* given by expression (35) (Rashevsky 
1951a, chap. xiii). Hence ¢$ is increased by the same amount. That 
amount is determined by the dynamics of the whole population of N 
individuals of all ages, of which we now consider a subset of age 6. 
Hence 


aD AON doe ean ar 6) = b e—A ar ip 
Yee pa—2e(S ae pre! a | (54) 
MSU oF (6 ==r5) * =e, (0 —7") 


Equations (53) and (54) give us, for the two cases, expressions 
which can be verified by sociological observation. The particular type 
of arational behavior may be, for example, religious creed. Then Y 
gives the percentage of agnostics at the age 6 as a function of other 
observable quantities. 

If 2 > be, then Y increases with 6. The smaller 7, for a fixed 
7, that is, the earlier religious education begins, the smaller Y at a 
given age @, according to equation (53). The same follows from 
equations (54) and (35). 

As we said at the beginning, the assumption of a linear increase 
in ¢; and ¢é, is not very realistic. More likely both quantities reach a 
saturation value and may be better approximated by 


p= K, (1 —e"™"); 2, = K, (1— e*"*) ; (55) 


where the parameters K;, k;, K,, and k, are distributed normally 
or otherwise. If the distribution of those parameters is known, the 
derivation of the distribution of «; — ¢, with ¢t as a parameter is 
straightforward but leads to computational difficulties and does not 
lead readily to closed expressions. This problem should, however, be 
considered as the next step. 

The author is indebted to Dr. George Karreman for a thorough 
critical discussion of the paper and for the correction of several 


errors. 
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A method of obtaining better fits to experimental curves for the 
hydrolysis of sucrose in the presence of invertase is found by assuming 
that the reaction is first-order, but that there is an initial time delay be- 
fore this form is assumed. More rigorously, a system of two differential 
equations describing the reaction is established, assuming that the for- 
mation of enzyme-substrate complex is irreversible, and a solution is ob- 
tained relating, at any time, the concentration of the products, the total 
concentration of complex which has been formed in that time, and the 
time. This solution is compared with other equations for the reaction 
and is used to calculate reaction constants. Curves showing the concen- 
tration of complex existing at time t are given and are employed in cal- 
culating an “equivalent”? enzyme concentration. This expression is dis- 
cussed. 


There have been many attempts to analyze the kinetics of the 
action of invertase on sucrose. The earlier work has been summarized 
by R. Kuhn (1925) and J. B. S. Haldane (1930). These analyses are 
based mainly on the theories of L. Michaelis and M. L. Menten 
(1913), although several empirical formulations have also been made. 
H. Lineweaver and D. Burk (1934) have used this earlier develop- 
ment in their determinations of dissociation constants by their re- 
ciprocal methods. Some recent discussions of the kinetics of enzyme 
reactions are given by P. W. Wilson (1949), A. E. Stearn (1949), 
J. B. Sumner and K. Myrbick (1950) and E. A. Moelwyn-Hughes 
(1950). 

During the course of experiments on the effects of mercurials on 
invertase activity by C. L. Gemmill and E. M. Bowman (1950), con- 
siderable data was obtained which lent itself to mathematical analy- 
sis. This analysis will be divided into three phases: (1) determination 
of the over-all reaction velocity; (2) determination of the velocity of 
the enzyme-substrate reaction; and (3) a method of calculation of 
equivalent enzyme concentration. 

Determination of Over-all Reaction Velocity. It is generally held 
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that the hydrolysis of sucrose in the presence of invertase proceeds 
in accordance with the Michaelis-Menten equation. However, it is 
still considered (Sumner and Myrbiick, 1950, p. 26) valid to use first 
order equations to calculate an over-all velocity constant for reac- 
tions which approximate closely to monomolecular reactions. That is, 
for the sucrose-invertase reaction, if y is the concentration of prod- 
ucts at time t, and if S, is the initial concentration of sucrose, then 

d 

pK So)» (1) 


from which we obtain: 


So 
So—y 


+ 
y=—S,.(1—e*'), or boat (2) 


This function of time gives a curve which does not fit the curve 
obtained by experiment, as indicated in Figure 1. Further, for K 
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FIGURE 1. Comparison of monomolecular curve (dotted line) with curve con- 
structed using the 6 assumption (solid line) for a sucrose-invertase experiment 
with S, = 12 per cent. Ordinates, per cent sucrose inverted. Abscissae, time in 
minutes. K = .0146, 6 = 4.2 mins. 


taken as the mean of the values calculated using expression (2), it 
was observed that the theoretical curve crossed the experimental 
curve in all cases, being above the latter initially. If the experimental 
curve were exponential, with S, as its upper bound, this would occur 
only fort= 0. 

For these reasons a time lag of 6 minutes from t = 0 tot = 6 
was assumed to occur before the reaction assumed the exponential 
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form..With this assumption, better fits to the experimental data were 
obtained, as indicated in Figure 1 for one experiment. This assump- 
tion takes mathematical form as a change in the limits of integra- 
tion of (1). Thus, fort = 60, 


t=t y 
a > dy S 
K | at= | Brea pas 
6 Soy te Se Y f ig 


0 


while, forO <t<0o,y=0. 
Expression (3), which is linear in the variables ¢ and 


In(S./So— y), 


is in convenient form for application of the method of least squares 
to calculate the numbers K and @. The results of such calculations 
for a series of eight experiments are given in Table I. 


TABLE I 


CALCULATIONS OF K AND 9 BY MEANS OF EXPRESSION (8); 
DATA OF GEMMILL AND BOWMAN (1950) 


oy ENZYME DRUG TEMP. K (Min-1) 4 (min.) 
ola Invertase None 29.7°C. .0508 3.1 
6% Invertase None 29.7°C. .0318 2.6 
12% Invertase None AD OF -0146 4.2 
6% Invertase Ascorbic PRL ACr .0227 0.7 
Acid 
6% Invertase HgCl, 29.7°C. 0128 2.7 
6% Invertase None 2 inG: .0207 4.0 
6% Invertase HegCl, 21.1°C. .0093 5.8 
6% Invertase HgCl, 21.1°C. 0090 4.5 


The introduction of the time @ in the equation has the practical 
value of obtaining a better fit of the curve to the experimental points. 
It has the theoretical value of showing that there is a finite time after 
the substrate is in contact with the enzyme before a measurable in- 
crease is observed in the reaction products. It also gives additional 
evidence for the pseudo-monomolecular type of reaction taking place 
in this case. 

Determination of Velocity of Enzyme-Substrate Reaction. The 
observation that the assumption of a time lag gave better fits to ex- 
perimental curves is an empirical method. However, it is of such a 
nature as to indicate that the appearance of the products begins with 
zero velocity. First order reactions following equation (2) begin 


306 KINETICS OF INVERTASE ACTION 


with a velocity equal to KS, , which is not zero. Hence, further analy- 
sis was made of the action of this process. 
The assumptions used subsequently are: 


1. The reaction is described, as usual, by 
E-+S2ES > E+ P. 


The rate of formation, at any time t, of the products, P, is propor- 
tional to the concentration of the intermediate complex, E'S , at that 
time, and the rate of production of E'S is proportional to the product 
of the concentrations of # and of S. 


2. The velocity constant for the reverse reaction, ES ~H +S, 
is negligible compared to that of the direct reaction. 


3. There is no inhibition by the products of the reaction. 
4. The enzyme regenerated is all active. 


Let x be the total concentration of HS which has formed in time 
t, and let y be the total concentration of products formed in time t. 
Because of assumptions 2, 3, and 4, we have that at time t, x — y is 
the concentration of ES , HE, — (x — y) is the concentration of free 
enzyme, and S, — « is the concentration of substrate. Then, using 
assumption 1, the equations describing the course of the reactions 
are: 


dy 
Spimmes eee (4) 
and 
a Re BEG (SERIE (5) 
dt 


The equation for the overall reaction would be obtained by in- 
tegrating these equations simultaneously. This has not been com- 
pletely done. However, the single equation resulting from these by 
the substitution, y — S, = z, is 


20h a bsys- ses? le (6) 


where a= K, + K,E,, b=K., c= K2/K,, andd = K,K.E, . 
By the usual substitutions, p = z’ and p(dp/dt) = 2", equation 
' (6) becomes 


dp 
pa, + (a—bz)p— ep? + dz=0, (7) 
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which is a form of Abel’s equation (Sylow and Lie, 1881) for which 
no solution in terms of elementary or tabulated functions has yet 
been given. 

A solution, y = f(x, t), is easily obtained, however. Into equa- 
tion (5) we substitute x — y = (1/K,) (dy/dt), from equation (4). 
This gives us 

BO ip way dy 
Ks(Ss;— 2x). dt Kes iss 
which is directly integrable. Using the initial conditions x(0) = 0 
and y(0) = 0, we obtain 


1 0 
—] =f t—-—y, 
Kk, err : Ras 
or 
K, So 
= t——] ; 8 
y—K,E, K, aes (8) 


This expression demonstrates the relation existing between the quan- 
tities x and y, but does not give the actual curve y as a function of 
t. Interesting results may be obtained, though, by the application of 
this equation to experimental results, as will be shown in a later 
paragraph. 

The term K,E,t in equation (8) might create the impression that 
enzymatic reactions approximating the assumptions made here are 
initially of zero order; i.e., that at the beginning of the reaction the 
production of y proceeds linearly. However, the initial slope of the 
function y(t) is necessarily zero, since, by assumption, 

dy _ a 4 
di K,(%— y) (4) 
and ~(0) — y(0) = 0. 

From the above point of view, an assumption that sucrose inver- 
sion under the action of invertase is a monomolecular reaction is 
equivalent to an assumption that «(t) represents the total concentra- 
tion of products formed in time ¢ and that y(t) is identically zero 
throughout the reaction. Under these circumstances, the solution (8) 
becomes 

Ky So 


= Kg 21 
Cee eRe LS, 
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or 
x = So(1 — e-B2Hot) , (9) 


which is the ordinary monomolecular expression with K = KE). 
This fact explains the observation that K, as calculated from the 
monomolecular expression, is proportional to the initial enzyme con- 
centration. 

The time-action function given by G. E. Cullen and D. D. Van 
Slyke (1914) (and Van Slyke, 1942) is 


viata Saw ll 
Keo a—az Ke 


t (10) 
where Kg is the reaction constant for formation of the enzyme-sub- 
strate complex, Kp is the reaction constant for the hydrolysis, and x 
is the total concentration of the products. In the notation used here, 
this becomes 


1 So y 
a + 
Ke aay Key 
or 
K, So 
= K,t—_— |] : 11 
y 1 K, assy (11) 


This equation, except for the coefficient of ¢, is what the solution 
(8) becomes for large ¢ , since then x is approximately equal to y, the 
concentration of HS existing at time ¢ approaching zero as ¢ in- 
creases. In that case, (8) becomes (for large t) 


kK, So 


= K,E,t ——I1n : 
y 14/0 K. Soy 


(12) 


The discrepancy is accounted for by examination of the develop- 
ment of formula (10). If Ke and Kp are the quantities stated above, 
then Ke must have units of (moles? X min.) and Kp must have 
units of (min.*). Then the “time required for one cycle,” (1/K,S) + 
(1/Ka), which occurs in the Cullen and Van Slyke development, is in 
units of time, whereas it should be in units of time per mole, or equiv- 


alent mole (see below), of enzyme. This is indicated more clearly in 
their equation, 


OS ite Sal Le ines 
dt  (1/KoS) + (1/Kp)’ 
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in which the left side has units of moles per unit time and the right 
side has units of reciprocal time. Hence, it is seen that their Kg is our 
KE, and their Kp is our K,E,. Therefore, in our notation, equation 
(10) is really 
K; So 
= K,E,t —— In : 
Z ety Ke So ty 


which is the same as equation (12). 

Since equation (12) is valid for large t , it may be used to deter- 
mine the values K,H, and K,/K,, by substituting into the expres- 
sion experimental values for y. For large t, though, experimental 
errors are more significant, since then the time-action curve has be- 
come flat. Therefore, care must be observed in the application of 
equation (12). 

i100 
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FIGURE 2. Upper curve: total concentration, x, of complex which has formed 
in time t. Lower curve: concentration of products at time t. Ordinates, per cent 
sucrose converted. Abscissae, time in minutes. 


As an example, using the experimental results obtained in the 
experiment whose graph is given in Figure 1, the following evalua- 
tions were made: 


K, 
K,E, = .0105 (moles/min.) , x. .547 (moles) , 


and 
K,E, 
K,/Ke 


The solution (8) may be written 


= .0192 (min.*). 
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2 = Sy(1 — eB Sot-/ i) ) | (13) 


Using the constants calculated above and the experimentally obtained 
values for y, this function gives the total concentration of the complex 
ES which has been formed in time t. The curve thus obtained resem- 
bles more closely the curves previously thought to represent the time- 
action curve for the formation of products of the reaction. Figure 2, 
using y values from the approximating curve obtained by the method 
of part (1), shows how the z-curve compares with the y-curve in 
another experiment. 

A second method of obtaining values for the constants is pos- 
sible. Since the assumption of a time lag, 6, gave curves which fitted 
the data accurately, it may be assumed that the expression (2) rep- 
resents the experimental curve for large t. The expression (12), re- 
written, becomes 


y= So (1 — e-K2(Hot-y/K1) ) , 


which is also valid for large t . Hence, the exponents must be approxi- 
mately equal, or 


Ky 
K,E,t —— =K P= . 
x! (t — 6) 


This is a linear expression, 


K, K,K 
arc (KE, — K)t — K 


‘2 
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FIGURE 3. Concentration, « — y, of sucrose in the complex at time t. Upper 
curve: S, = 12 per cent. Lower curve: S, = 6 per cent. Ordinates, concentration, 
of sucrose in complex. Abscissae, time in minutes. : i 
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But for large ¢t, the curve is flat, dy/dt = 0., or Koby = K. There- 
fore, if K is accurately calculated in an experiment which is carried 
over a long period of time, it has approximately the value K.E,. 

For the experiment whose graph is Figure 2, this method gives 
KE, = K = .0318. This experiment, with S, = .1512 M , was carried 
out long enough for y(t) to level off. Hence the data at three large 
values of ¢ were used in the expression (12) to obtain an average 
value K,/K, = .576. 

In Figure 3, the curve « — y, representing the concentration of 
sucrose existing in the complex E'S at time ¢, has been plotted for 
the two experiments above, using the curves of part (1) as the ex- 
perimental y-curve. B. Chance (1943) devised a method for direct 
observation of the curve « — y for the peroxidase-H,O.-reaction. The 
curves obtained by the methods described above show an obvious 
similarity with respect to shape to those reported by Chance. In the 
same article, the G. E. Briggs and Haldane (1925) equations, which 
are similar to those given here, have been integrated by means of a 
differential analyzer, without the assumption made by them that 
d(z — y)/dt is always so small that it may be considered to be zero 
throughout the reaction. 

Method of Calculation of Equivalent Concentration of Enzyme. 
Using values obtained for x — y, as a function of t, the value of K, 
may be calculated according to equation (4). The slopes, dy/dt , must 
be obtained from the graph of y as a function of t. Assuming a con- 
stant error of observation, the following values were found in the two 
experiments whose graphs are given in Figures 1 and 2. In the first, 
K, = .2165 (min.-) and, since K,E, = .0183, we have EL, = .0479 M 

If the molecular weight of invertase is around 50,000 this value 
of #, leads to an unreasonably high density for the invertase prepara- 
tion used in these experiments. Therefore this figure cannot repre- 
sent the actual initial concentration of the enzyme. It may, how- 
ever, represent an equivalent concentration in the sense that, if one 
molecule of enzyme combined with a large number, 7, of substrate 
molecules, rather than as postulated in part 2, then the number found 
above would represent n times the actual initial concentration of in- 
vertase. To obtain the actual initial enzyme concentration knowledge 
of the magnitude of n is required. 

It may be that before any of the ES complex is hydrolyzed a 
certain fixed number, 7, of substrate molecules must combine with 
an enzyme molecule. In that case, one would expect an actual time 
~ Gelay as postulated in part (1). It is much more likely that the num- 
-ber nis a statistical figure, namely, the average number of molecules 


312 KINETICS OF INVERTASE ACTION 


of substrate which are associated with one molecule of enzyme in the 
complex. In the latter case, one would probably expect the initial zero 
velocity as developed in part (2). 
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This paper deals with an analysis of a distributed model of the 
game of “gossip,” in which a message is passed through a line of in- 
dividuals, and the final (in general, garbled) result is compared with 
the original ungarbled message. The deterioration of information (de- 
fined in the sense of Shannon and Wiener) along the line is calculated, 
and exact as well as asymptotic formulas suggesting approximate lin- 
ear electric network analogues are obtained. 


I. Object of this Paper. In discussing communication nets in- 
volving human beings and/or other automata one is tempted to call 
upon analogies from the theory of linear electric networks to describe 
the mutual exchange of information in terms of analogous current 
or voltage propagation. In this paper we consider a particular type 
of communication system, and investigate the approximations un- 
der which for that particular case an analogy can rigorously be 
drawn, using the strict definition of information proposed by Shan- 


non and Wiener.* 


FIGurE 1. Fundamental Communication system. 
Il. The Theory of Information. Consider a communication sys- 
tem (Fig. 1) described by the square n X n matrix 


P(i, j) = joint probability that the ith symbol is 
sent, and, as a result of this, the jth symbol re- (1) 
ceived. 


We shall define 


*See list of references at end of paper. 
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Se Ga eens > P(i,7) =a priori probability of 
Jez 


(2) 
sending symbol 7, 
OG y=] > P(i,7) =a priori probability of 6 
receiving symbol 7 , 
Q:(7) =P(i,7)/P() = conditional probability 
= so ee. (4) 
that 7 is received if 7 is sent. 
It is evidently necessary that 
= Qi(7) =1, (= 1,2,----, Nn). (5) 
7 


Shannon has shown* that the information rate of the communi- 
cation system of Figure 1 should be defined as 


R=H—E, (6) 


where 


H——Y Q(j) logQ (J) is the negentropy of 
fet 


ft 
the received symbols} (7) 
| Se > P(i,7) logQ:i(7) is the equivocation P 
i,j=1 8 


of the system 


In the special case when all the symbols to be sent are equally 
likely we have 


Pi lin. (9) 
Therefore, P(t,7) = (1/n)Q; (J), giving 


E=— (1/n) Px (j) log Q:(j). (10) 
Also, if in addition, Q;(7) = Q;(#) 


Qi) =E PWD AG) = C/n) » Q:(7) = (1/n) (11) 
according to (5). 


*Cf. Shannon, 1948a, p. 409; also see Reich, 1951 for a discussion of the 
arbitrariness of the choice of the definition of information. Shannon uses H (y) 
for our H, and H,(y) for our E'. Shannon uses logs to the base 2; we employ 
natural logs. 


+The term “negentropy” is due to Brillouin, 1950. 
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In this particular case, therefore, H = logn, so that 
k= logn + (1/n) & Q:i(j)l0gQi (7). (12) 
V,j=1 


Ill. Model for the Game of Gossip. The game is postulated to 
consist in selecting a symbol (message) at random from a set of n 
symbols, such that all n symbols are a priori equally likely, and pro- 
pagating it down the line of individuals. The participants are as- 
sumed to know what the a priori possible messages are, and to rea- 
lize that they are a priori equally likely. 

We shall let x measure distance from the beginning, or origin 
of the line. 

We now make the idealizing postulate that if the symbol arriv- 
ing at x is the ith symbol, then the probability that an arbitrary 
symbol 7 arrives at x + dz is given by the conditional probability 


matrix 
1— a(n—1) dx ada Ae ada 
[Q: (7) 2e+a2] = adx 1—a (n—1) da ee ada 
ada adx 1—a(n—1) da | 


Here a should be thought of as a number proportional to the density 
of human beings per unit distance along the line. In the realistic case 
distance would be measured as proportional to the number of human 
beings. Therefore a plays the role of a characteristic frailty (inde- 
pendent of n) which must be found by experiment.* 


Symbolically 
Qi (7) 2,e+d2 = (1 a andx ) 0ij ate adxt. (13) 
We shall let 


fi; («) = conditional probability that j will be received 


Foe a (14) 
at x if iis inserted atx=—0. 


Then, using (13) and (5) we find: 


*Although an exact solution for the case of discrete individuals could have 
been obtained the continuous case was pur eoeey. chosen because it is simpler. 
Lite 4 
+5,; is the Kronecker delta: 6, = | pineses 
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fis (a + dt) =3 fa) Qu (A) ave 


= fi;(“) — fis (x) anda + adz. woes 
Thus f;;(x) satisfies the differential equation 
fis (x) + anfi; (a) —a=0. (16) 
The solution, with the initial condition f;;(0) = 6i; is: 
fag (0) 6-0" bgp bh (CL ec) /0. (17) 


IV. Information Flow in the Game of Gossip. According to 
(10), if H(z) is the equivocation in the stretch of line (0, x), 


E(x) =— (l/n) & fas (%) logti; (x). (18) 
i,j=1 
Substituting (17), manipulating algebraically, and inserting into 


(12) gives us as the basic result 


Fn (2) =A TS toe Paine) 


(19) 


ia (8 1) Cee ed 
n 


log (1 — e-@"”). 


_ Figure 2. Information along gossip line as a function of distance from 
origin. 


Here R,(x) is the information available at « regarding the original 
message. A plot of the function is given in Figure 2. As obvious 


checks we have 


lim R,, (2) = logn , (20) 
lim R,.(%) =0, (21) 


R,(x) =0, (22) 
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lim’ #,,(z) =0- when. «+0. (25) 


it is interesting to evaluate the derivative of R,(x). This is 
1 ee en-ane 
1+ (n—1)e 


R,' (a) =a(n—1) e™ log | (24) 


This shows that 


(a) £&,(«) is strictly decreasing as a function of x except when 
a(m—1) = 0 in which case R,,(x”) = const. 


(b) The slope of R,(#) at the origin is infinite when a(n—1) # 0. 

V. Estimates and Asymptotic Relations. The following two re- 
lations are easily obtained from (19). For small z: 

R(x) =logn + a(n—1) wloga + 0 (z?logx). (25) 


For large n, and ax larger than some positive quantity independent 
of n: 


Rg (x) = (8"/2) e2" + 0 (67). (26) 
An interesting relation that can also be obtained is 
calerraetal 
“L a(n—1) 
lim ————— = 1 fe ~ 0.37. (27) 
aa R,, (0) é 


This indicates that for large n if we go out a distance 1/[a(n—1) ] 
from the origin we will find that the information has deteriorated to 
approximately 37% of its initial value. 


VI. Linear Electric Network Analogies. Linear electric net- 
work analogy approximations can be obtained by exploiting either 
(26) or (27). Equation (25) cannot be used because the zlogr term 
prevents it from being the solution of a linear differential equation 
with constant coefficients. 


C=1 FARAD 


INITIAL VOLTAGE = 
LOG a VOLTS 


: Sen ; : ‘iaiee 
Figure 3. Approximate analogy for gossip line having frailty per unit 
tance a, for get n possible messages. Here time is the quantity analogous to 
distance along the gossip line, and voltage is analogous to information. 
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An electric analogy could use either time or distance as the quan- 
tity analogous to #. If distance is used, we have the case of an elec- 
tric transmission line with attenuation. Recognizing that a series 
combination of resistance R and capacitance C has a time constant 
RC seconds (time in which the original capacitor voltage goes to 
1/e = 0.37 of its initial value) a simple lumped analogy making use 
of (27) can be obtained. Namely, consider a capacitance of value 1 
farad and leakage resistance [1/a(m—1)] ohms (Fig. 3). If the 
capacitance is originally charged to logn volts then the capacitor volt- 
age as a function of time is approximately analogous to the informa- 
tion in the gossip line as a function of distance. 


VII. Supplementary Remark. Of the several possible modifica- 
tions of the preceding analysis that come to mind the most obvious 
one is the adoption of a different Q: (7) 2,0. This will simply change 
the differential equation (16), but the rest of the work would be the 
same. One possible example is to make a a function of x. This would 
reflect the effect of relative position in the group on the frailty of the 
individual. 
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The problem of cycles in random neural nets is considered. An ap- 
proximation is derived giving the expected number E(k, a, n) of cycles 
of length k of which a neuron in a random net will be a member. 

The approximation is further simplified for the case n >> k. 
Ey this case E(k, a, n) = a*/n, where (a) is the axone density of 
the net. 


Consider a random net such as described by R. Solomonoff and 
A. Rapoport (1950). The ‘‘weak” connectivity of such a net is de- 
fined as the probability of the existence of a path through any num- 
ber of internuncials between two arbitrarily chosen neurons N, and 
Nz. 

The approximate method developed by Solomonoff and Rapoport 
gives the following equation for y: 


yt = et, (1) 


where (a) is the axone density of the net. 

In particular, if N, and N, are identical y gives the probability 
that there exists some path from N, back to N, through one or more 
internuncials. Therefore, y is the probability that an arbitrary neu- 
ron of a random net is a member of a cycle. 

For the case a = 1, a neuron can be a member of at most one 
cycle. For large n (the number of neurons in the net) the fraction 
of neurons that are members of cycles is vanishingly small, being 
proportional to n°. 

For finite n and a = 1 the probability that a neuron is a member 
of a cycle of a given size has been given explicitly (see Rapoport, 
1948, and Shimbel, 1948). Since in this case the probabilities that a 
neuron is a member of a cycle of m links and v links are mutually 
exclusive, it follows that y is given by the sum of such probabilities. 

However, for a > 1 a neuron can be a member of several distinct 
cycles, some of them possibly having neurons in common. Therefore, 
in this case y cannot be derived as the sum of such mutually exclu- 


sive probabilities. 
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The problem can be approached in a slightly different way. Let 
x‘ be the probability that a randomly selected path of k links start- 
ing with a picked neuron of a set of nm neurons each giving off (a) 
axones is a closed cycle. 

By elementary considerations we find that 


n—-1 n—2 n—k+1 1 


OS ; 2 
dk Ae Pi A . (2) 
From expression (2) we obtain the relation 
n! 
(CD == ———_.,, 3 
PH (n— ky! ‘I 
Using Stirling’s formula on equation (3) we obtain 
n™ 
pe (4) 


in niet (n— k) n+ ek 


Now if we multiply ¢,“ by the total number of paths T of & links 
which can be taken starting with a given neuron we obtain an ap- 
proximation to the expected number E(k, a, n) of cycles of length 
kt which a randomly selected neuron will be a member of. Thus we 
have 


E(k,a,n)=—¢T. (5) 


Moreover the total number of possible paths of & links starting 
with a given neuron is equal to a*. We obtain this expression by con- 
sidering that the “first step” of the path can be taken along any one 
of (a) axones. The second step can also be made in (a) different 
ways so that a path of k links can be made in a-a-aa- (k times) dif- 
ferent ways. 


From this we find that 


mr-k-3 a k 
Haris eaten aie 
(k,a,n) =" —_( 5 (6) 
Expression (6) is only an approximation because the paths of k links 
which can be taken starting from a given neuron are not entirely 
independent. Any two paths in the a* different possible ones may have 
one or more links in common. Also two effectively identical cycles 
(in terms of neurons) may differ in that distinct axone paths may 
exist. For these reasons expression (6) does not strictly follow the 
definition of expected number. 


For the cases k = 2,a = 1 and k = 2,a= 2 the expression 
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ox‘ a* accurately gives the expected number as tested from an exact 
formulation. The accuracy of expression (6) for increasing k and a 
is yet to be determined. 

Note that for n >> k, ¢x‘” is very close to 1/n. This means 
that for n >> k we can write 


k 


E(k;a,n) =. (7) 


In view of the extreme complexity implied by the exact formu- 
lation, expression (7) seems suprisingly simple. Each of the perti- 
nent variables appears only once in the function. 

This investigation is part of the work done under Contract No. 
AF 19(122)-161 between the U. S. Air Force Cambridge Research 
Laboratories and The University of Chicago. 
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A situation is considered in which an individual is given an oppor- 
tunity to risk a certain amount of money or goods in order to gain a 
larger amount, providing the result of some uncertain event proves 
favorable. A neural mechanism is introduced in which the probability 
of success can be ordered according to its value. The response to a situa- 
tion thus becomes dependent upon the probability. An expression is 
then derived for the amount that would be risked in terms of the prob- 
ability and of the amount that would be gained in the event of a success- 
ful outcome. Similar expressions are obtained for the case of insurance 
against loss. Results of questionnaires indicate that individuals can be 
classified according to their pattern of behavior in the situation con- 
sidered. The various types can be most easily recognized when a plot is 
made of the relative amount risked against the probability of success. 
In a general way, these types can be understood in terms of the equa- 
tions derived. 


The situation which we shall consider is one in which an indi- 
vidual is given an opportunity to risk a certain amount f of goods 
for the chance to obtain an amount v, under circumstances in which 
the probability of success is a. Our problem is to derive a function 
f(v, w,a), w being a measure of the quantity of goods possessed by 
the individual.* In order to do this we must first treat certain com- 
ponent parts of the problem. Therefore, we shall discuss a mechanism 
by which the fraction of successes, in a given situation, is abstracted. 
Thus to a given process, which on repeated observation is found to 
have a certain fraction of successful outcomes, there is a correspond- 
ing neural element [at #(a)]; and the correspondence depends only 
on the value of the fraction (a). Another mechanism is then intro- 
duced which makes it possible for the effective value of the stimulus 
(v) to be reduced by any given fraction (a’), the value depending 
upon which one of a series of neural elements N., is acting at the 
same time as the stimulus. Through a process of conditioning, the 


*A better approximation might be a linear combination of the amount pos- 
sessed, the rate of income, and the rate of outflow of goods. 
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elements corresponding to probabilities [at @(a)] become function- 
ally connected to the elements (Na’) which affect the response to the 
stimulus (v). The result is that the effective value of the stimulus de- 
pends upon both the stimulus magnitude (v) and its associated prob- 
ability (a). The problem, then, is reduced to that of a choice between 
two alternatives: taking or not taking the risk. To each alternative 
there corresponds a net effective stimulus value. The risk is taken 
if the stimulus value for this situation is the larger one. 

Mechanism for the Representation and Ordering of Probabil-. 
ities. Consider a test situation which can only result in a successful or 
unsuccessful outcome. Suppose that this situation has been observed 
by an individual to give an successes out of n trials. Let a group 
of neurons M; represent the memory traces from each one of the ob- 
served trials. Let each of these neurons M; act upon the two neurons 
M’' and M” (Fig. 1). For convenience, the neurons M; are grouped 
in Figure 1 in such a manner that the an corresponding to the suc- 
cessful outcomes are together. Due to conditioning the M; correspond- 
ing to successful events will have greater effects upon M’ and M” than 
those corresponding to unsuccessful trials. In Figure 1 this differ- 
ence is indicated by the use of two end bulbs and one end bulb, re- 
spectively. Let the thresholds of M’ and M” be one and two, respec- 
tively. Let Vu. be the average frequency of firing of M;. If the »’s 


FIGURE 1 
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are, on the average, equal to », then the average frequency of action of 
M' and M” will be 


Vg a rt ty hes Hae = yn; (1) 
Yure Tee + My = van. (2) 


Let M’ and M” send collaterals to each element in the center 
A, each sending different numbers in different parts of A. The neu- 
rons in A are arranged linearly so that we may refer to the neurons 
at a position x. Furthermore, we consider the number of collaterals 
reaching A to be sufficiently large so that we may define a density of 
the endings at x. Let the density at x of those originating at M’ and 
M” be Ean(%) and E,(x), respectively. Then the total excitation at x 
is given by anHa,(x) + nH,(x). In general, the total excitation de- 
pends upon x and upon a. If there is a point “ at which the total ex- 
citation is a maximum, this can be found by differentiating the above 
expression and setting the derivative equal to zero. If this is done 
and the resulting expression is solved for a, we find (Z’ = dE/dx): 


a(%) =— EB, (&)/Ean (&). (3) 


In the following we shall assume that the resulting relationship a (2) 
is monotonic, although it is probably necessary only that #(a) be 
single-valued. The elements of A are considered to mutually inhibit 
each other in such a manner that only those elements in the neighbor- 
hood of # are able to transmit (cf. Rashevsky, 1948, p. 377). 


Nz, N Ny 


oan Ca 
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A Mechanism for Altering an Input Frequency by Various Frac- 
tions. Let a stimulus of intensity S, (Fig. 2) result in a frequency » 
of impulses, the frequency depending upon the intensity. If during 
the same period of time a neuron Na is firing at a frequency 1/6, 6 
being the absolute refractory period, then neuron N (Fig. 1), witha 
threshold of 2, fires with an average frequency 


vy = ay (Sy). (4) 


This follows since the fraction of time that Na sends impulses to N 
is, by construction, equal to a. The thresholds are taken to be equal 
to two unless indicated to the contrary. The assumption that the Na 
fire with frequency 1/6 is not as restrictive as it might first appear. 
Although each Nz results in a periodic pattern of impulses, the effect 
is to reduce »(S,) by a fraction a on the average only, since, in gen- 
eral, S, results in a random temporal sequence. 

It is essential that only one Na act at a given time. In order to 
insure this the Na must mutually inhibit one another (not shown in 
Fig. 1) in such a manner that only one element is able to transmit, 
even if several neighboring elements should be stimulated simultane- 
ously. 

The net of Figure 2 gives a spatial correspondence # in A to the 
fraction a of successful events. We will now suppose that there are 
non-functioning connections between each element in A of Figure 2 
and each element S. of Figure 1. Let these connections be of such a 
kind that they can become functional through conditioning. Consider, 
for a moment, two stimuli S, and S. which represent situations with 
probabilities of successes a, and a2. In a situation in which there is a 
choice between these two, the best result, on the average, occurs 
when the stimuli are weighted by their respective probabilities. Thus 
when S, is, by chance, weighted by an amount a, , due to the simul- 
taneous action of S, and Sa, , the result is likely to be more favor- 


able than when S, and some other S. act together. But when S, is 
brought into activity, the neurons in the region about x(a,) become 
active because of the mechanism of Figure 2. Therefore the degree 
of conditioning between #(a,) and Sa, tends to be greater than that 


between %(a,) and any other S.. Under these conditions, then, a 
stimulus S , whose probability of success is a, tends to result in an 
output frequency of value ay(S) instead of »(S). 

In the above argument we have assumed that the failures pro- 
duce no extinction in the conditioning process. If we did not assume 
this, the more uncertain event would be given less weight than its 
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corresponding value a. A more detailed analysis would be required 
to evaluate this effect. 


The Determination of the Function f(v, w, a): Risk for Gain. 
We return now to the situation originally proposed. We shall use an 
expression of the following form for the relation »(S,) : 


v (Sy) =A log(1 + Sy»). (5) 


This type of relation has been commonly used in mathematical in- 
vestigations of the central nervous system and of social behavior 
(Rashevsky, 1948, 1950). The number 1 is used in the argument of 
the logarithm to avoid introducing discontinuities and threshold 
phenomena. 

We shall suppose in the choice between two alternatives that one 
is chosen because of the greater net excitation at some level C. If 
the situation presents a choice of taking or not taking a risk for some 
gain, then, according to our model, we need only compare the net ex- 
citations corresponding to the two situations to determine the outcome 
(Landahl, 19388). In so doing we neglect any random phenomena 
which would lead to distributions of the probabilities of the amounts 
that would be risked under given circumstances. 

In the situation under consideration, failure to take the risk 
would result in the individual’s possessing exactly what he already 
had, that is, w. As the only stimulus is S, = w, in this case we would 
have the following excitation at C: 


A log(1 + yw) = Aalog(1 + yw) + A(1—a)log(1 + yw). 


If the risk is taken and the result successful, an excitation corres- 
ponding to this would be given by equation (4). In (4) a has the 
same meaning as here and v(S,) is given by equation (5), with S, = 
w+vu—f,v —f being the gain. In the event of failure, the excitation 
would be given by equations (4) and (5) with a replaced by (1— a), 
the probability of failure, and S, = w — f, f being the loss. The total 


excitation due to these two possibilities is then: 
Aalog (1 + yw + yu — yf) + A(1—a)log (1 + yw — yf). 

If no other factors are taken into account, then f , the maximum value 

that would be offered, is given by equating the degrees of excitation 

corresponding to these situations. From this we obtain: 

Pe yu 


1+ ywt+ yo —yf 
1+ yw 


1 
ro on 1+ yw 
from which we can calculate f in terms of v, w, anda. 


)+A(1—o) og )=0, ) 
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From our previous discussion it is clear that the weighting by a 
and (1 — a) is only to be expected in the absence of any extinction. 
The more general case can be included if the two coefficients A in (6) 
are made functions of a and (1 — a) respectively. Since a generally 
is small, we may introduce A for the first and A(1 +. 8) for the sec- 
ond to allow, to some extent, for this effect. This could be interpreted 
as meaning that the anticipated gain and loss are not equally 
weighted. 

In addition to this effect we shall consider the effect of the prize, 
and the excitation or inhibition due to the situation. When «a is equal 
to zero or one, there is no uncertainty. Thus an expression of the 
form 


AGS (ae 0) log (1+ )= Aalog( 1+) (7) 
WwW WwW 


may be used for the stimulus value of the prize. The quantities 
ad 2 0 and ¢ are constants and a is defined by expression (7). 

For the contribution due to the individual’s attitude toward sit- 
uation we shall add a quantity of the form 


Ab= Ab, (a®’ — a), (8) 


where b may be either positive or negative, depending upon whether 
the situation is interpreted as pleasant or unpleasant. 
If we add either terms of (7) or (8) to expression (6), and make 
the generalization indicated above, we obtain the expression: 
1+ yw yu — pf Lit yw =a 


lo = 
a log agi + (1+ B) (1—a)log peas 


4 (9) 
+ alog(1+— ) +b=0, 
Ww 


from which we can obtain f(v, w, a). 
From equation (8) we can see that ifa=0,f=0,anda=1 - 
then f = v. Now suppose that a = a and b = b, , except for a = 0 

and 1, and let w >> 1/y, v << w and a << 1. We shall define 

f 
k=—. 

a (10) 
Then, expanding (9), we have (after rearrangement) : 
ig tt +a/at+ bw/av v(a?+afp+o? 8B +a—<a?) 


1+f—afB 2iatl -ipio~ ee) Peiee Abie a 
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FIGURE 36 


In Figure 3 some examples of the types of relations which may 
occur for selected values of the parameters are graphically shown. 
Except for the case of Figure 3b, the results are obtained from the 
simplified expression (11). From these graphs it can be seen that, 
depending upon the choice of parameters, rather different types of 
curves can be obtained. 

In order to ascertain what types of relations might actually oc- 
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FIGURE 3c 


a= 0.00I 
v << 0.0lw 
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FIGURE 3d 


cur, a number of subjects were given an opportunity to offer various 
amounts for various prizes ($1, $10, $100), in situations in which the 
probabilities (10-* to 3/4) were stated. The subject was given a state- 
ment of the prize v and probability o for each situation; then a restate- 
ment in different terms. For example: “Prize—$10, chances 1 in 10; if 
each of ten individuals contributes $1 to a total prize of $10, the most 
I would pay for a chance to win is ————.” Then followed a set of 
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values. The subject’ was instructed to proceed from the low values un- 
til he reached the last value he would offer for a chance at the prize. 
This value was recorded by the subject to avoid possible psychologi- 
cal pressure on him by the experimenter. Most subjects were told 
that the experimenter reserved the right, within the day, to expect 
the subject to accept an offer for any one of the situations at the 
price recorded by the subject. For 26 subjects (18 male, 8 female) 
the set of questions was given three times, usually on different days. 
For some individuals the results were very uniform; for others the 
results were highly erratic. For this reason the result of averaging 
was occasionally ambiguous. Wherever the mean was close to the me- 
dian, the former was used; otherwise, the mean and median were 
averaged. If the subject was unwilling to offer one cent, he was given 
an opportunity to give the number of chances for which he would pay 
one cent. pong iad ea. 

The results from these subjects indicated that there are perhaps 
four primary types of curves. In one group the relation between K 
and « is one in which there is a continuous decrease in Fk with a up 
to a value of 3/4. The slopes and the positions of the curves vary 
from individual to individual. Six of the twenty-five subjects were in 
this group. An example is shown in Figure 4b. In some cases two of 
the curves (e.g., v = $1 and v = $10) would almost overlap. 

A more or less pronounced minimum in the curves appeared in 
a second group. It was less pronounced and often absent for v= 1. 
Figure 4a shows an example of this group. In several cases the mini- 
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mum is much more pronounced than in the example shown. In gen- 
eral, the order in which the curves occur is the same as in Figure 4b, 
although, for example, the curves for v = 1 and v = 10 would over- 
lap or even invert for one individual. The curves for v = 1 and v = 
100, however, were never inverted. Out of the twenty-five cases, seven 
would best fall into this class. 


Since the values of a close to unity were not used, it is not pos- 
sible to determine whether or not the above groups differ only in the 
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position of the minimum. Furthermore, irregularities in the curves 
often make the existence of a minimum uncertain. Thus six subjects 
could be equally well classified into group one or two. 

A third group, in which there were four cases, is illustrated by 
the example in Figure 4c. The feature characterizing this group is 
the decrease in R with decreasing a. In one case the curves for 
v=1,v=10, and v = 100 were practically superimposed. 

Two other subjects could be characterized by their R values be- 
ing roughly constant. One was very erratic, the other quite regular. 
The results for the latter individual are shown in Figure 4d. It may 
be that this case is more similar to the case shown in Figure 3d. 

There was no evidence to suggest a sex difference in the type of 
response. 

Since there are such great differences in the responses in the 
different groups, averaging, depending on the method used, gives 
very different results. An arithmetic average results in a curve for 
the ratio R which is greater than unity for small values of a and 
smaller than unity for values of a greater than about 10-* (Preston 
and Baratta, 1948). 

Of eighteen other individuals who were given the questionnaire 
only once, three made no offers, four made offers very much smaller 
than any in the above groups, and three could be fairly satisfactorily 
classified into group one, seven into group two, one into the fourth 
group and one resembled Figure 3d. 
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No offers were actually made to any subject at the time the ques- 
tionnaires were distributed. However, each of seven of the subjects 
was offered a chance at the median value of his own offers for a se- 
lected value of a and v. Value of a from 1/100 to 1/2 and values of v 
of $1 and $10 were used. The offers were accepted in all seven cases. 
The subjects on the whole were neither anxious to accept nor reject 
the offers . : corse 

The Determination of g(L, w, a): Insurance against Loss. We 
may next ask what amount g would be given in exchange for security 
against a loss L. If the amount g is given out, the result will be 


A log(1 + yw— yg) =Aaclog(1 + yw— yg) 
+ A (1 —» log Dyed J 


Otherwise, if a is now the probability. of loss, it would be 
Aalog(1 + yw— yL) + A(1—a)log(1 + yw). 


The prospect of loss may also act as an unpleasant stimulus. If 
we consider that its effect is just the same as a negative prize, then, 
the effect would be the same, except that v is replaced by — L. In 
the previous case the prize could be anticipated in the immediate 
future, while in this case the loss may occur in the distant future at 
an uncertain time. For this reason a and b are primed. Thus we have 
the following expression from which to determine g(L, w,a): 

ye) 


1+ yw—yg 1 
Se a : 
Loe yt 9 L, ( nae woes 1+ yw 


—a log(1— L/w) — b'=0. 


a log 
(12) 


The £’ is introduced somewhat arbitrarily, as in (6). Since a has a 
different meaning here, and since the time of occurrence of the loss 
is uncertain in this case, 6 may be quite different; hence it is primed. 
As in the previous case, a more detailed study is required to evaluate 


If a= 0, then, from (11), g =0.If a=1, then gy = L. For 
yw >>1,a<<1,L << w, we find approximately the same expres- 
sion for g/aL as for f/av of (11) ; however v is replaced my minus 
L. This makes g/aL greater than unity for large L/w and for small 
a. Unless L/w is small, a is not too small, and f’ is small, g/L will 
exceed a . The conditions for which g/aL will exceed unity are, there- 
fore, more likely to be met in this case than in the previous case of 


f/av. 
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It would be of interest to relate the parameters in equation (12) 
with those of equation (9). A more detailed study would first be re- 
quired to elaborate upon the quantities which have been replaced by 
the parameters 6 and f’. One then might be in a position to predict 
correlations between the parameters. 
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JAMES T. CULBERTSON. Consciousness and Behavior: A Neural Analysis of Be- 
havior and Consciousness. 1950. 210 pp. Dubuque, Ia.: Wm. C. Brown Co. $4.25. 


Whether or not the so-called “digital” approach to the central nervous sys- 
tem will become a well-beaten path in theoretical physiology will be determined 
among other things by the exploration of its implications. This job is undertaken 
with remarkable thoroughness by James T. Culbertson of the School of Philos- 
ophy at the University of Southern California in his book Consciousness and Be- 
havior. 

The digital model of the central nervous system was first rigorously de- 
scribed in this journal by W. S. McCulloch and W. Pitts (Bull. Math. Biophysics, 
5, 115-83) and has undergone considerable theoretical development by N. Rash- 
evsky, H. D. Landahl, J. Roberts, and others. 

By theoretical development of a model we mean an investigation of what the 
model] can do. Part one of Dr. Culbertson’s volume is devoted to such an inves- 
tigation. He shows, for example, the construction of digital type neural nets which 
give invariant responses to “cardinal number,” i.e., responses where the stimula- 
tion of a certain number of elements results in a firing of a particular neuron, 
regardless of which elements were stimulated. Next the “bottle neck” problem 
is solved in terms of the digital net. It is shown how one-to-one mapping from one 
set of neurons to another can be accomplished even though the number of con- 
necting fibers between the sets is smaller than the number of neurons to be map- 
ped. General questions concerning the transmission of “information” in digital 
nets are raised in relation to this specific problem, which originally came to the 
attention of neuro-physiologists in connection with the mapping of the visual 
retina upon the striate area. 

Further, Dr. Culbertson treats the problem of gestalt in connection with the 
recognition of contour shapes and proposes a model somewhat similar to but not 
identical with that of Pitts and McCulloch (Bull. Math. Biophysics, 9, 127-47). 
It is also considerably more specific and detailed than the Pitts-McCulloch model. 

Finally the problem of long-term detailed memory is treated on the basis of 
“synaptic variation,” a formal postulate implying that end bulbs, previously in- 
effective in eliciting the firing of the neuron on which they synapse, become ef- 
fective if, when they are active, the neuron is fired by other effective bulbs. This 
postulate has also been proposed by D. O. Hebb (Organization of Behavior), by 
A. Shimbel (Bull. Math. Biophysics, 12, 241-75), and by others. The advantage 
of this hypothesis is that it obviates the necessity of explaining learning and con- 
ditioning processes in terms of reverberating circuits and thus is “neuro-econom- 
ical,’ to use Dr. Culbertson’s term. The synaptic variation hypothesis also seems 
more reasonable to some authors (e.g., Hebb) in accounting for long range learn- 
ing and memory processes than reverberating circuits, which are believed to be 
active only for short periods of time. The formal implications of the sypnatic 
variation hypothesis are, however, entirely analogous to those derived from rever- 
berating circuits in the conditioning process. 

Part two of Consciousness and Behavior is devoted to a theory of conscious- 
ness and sensation, which Dr. Culbertson prefers to treat separately from the 
theory of behavior. The philosophical reasons for this Cartesian dichotomy are 
stated by the author in a futuristic skit in which Troglodyte Titchno, a leader of 
a small rebellious band of “neo-structuralists,” shouts defiance at his captors, 
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apparently the entrenched intellectual rulers of 2000 A.D., who have succeeded in 
banning all considerations of consciousness from Official Psychology. 

The polemic is spirited, and the theory of psycho-spaces proposed by Dr. 
Culbertson in his support of the neo-structuralist thesis is original and provoca- 
tive but entirely beyond the scope of this journal. 

Some quaint effects result from Dr. Culbertson’s use of integers instead of 
algebraic symbols to designate numbers of neurons involved. Thus we read ex- 
pressions like this: “If 264 cells fire in P and 746 cells fire in Q.... .” This 
usage is, however, in harmony with Dr. Culbertson’s extremely explicit treatment 
of his models. In this reviewer’s opinion, this minor eccentricity adds to rather 
than detracts from the general clarity of treatment and the unique charm of 
Dr. Culbertson’s style, which sets this book apart among philosophical disser- 
tations. 

ANATOL RAPOPORT 
The University of Chicago 
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